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Preface 



A Book on C conveys an appreciation for both the elegant simplicity and the power of 
this general-purpose programming language. By presenting interactive running pro¬ 
grams from many application areas, this book describes the ANSI version of the C lan¬ 
guage. The complete language is presented in a step-by-step manner, along with many 
complete working programs. 

Where appropriate, we discuss the differences between traditional C and ANSI C. 
(Traditional C still remains in wide use.) Dozens of example programs are available to 
illustrate each important language feature, and many tables summarize key informa¬ 
tion and provide easy access for later reference. Each chapter ends with a summary and 
exercises. The summary reviews key elements presented in the chapter, and the exer¬ 
cises augment and extend the text. 

This book assumes a general-purpose knowledge of the C language. It is intended for 
use in either a first or second programming course. However, it can be readily used in 
conjunction with courses on topics such as comparative programming languages, com¬ 
putational linguistics, data structures, database systems, fractal geometry, graphics, 
numerical analysis, operating systems, programming methodology, and scientific appli¬ 
cations. C is suitable for applications from each of these domains, and all features of C 
needed to code such applications are explained. This book is appropriate for a data 
structures course because advanced data structuring features such as enumeration 
types, unions, self-referential structures, and ragged arrays are discussed. For operat¬ 
ing systems courses concerned with UNIX or Windows 95/NT, the book explores the file 
structure and systems routines that enable the C programmer to add to existing sys¬ 
tems libraries and understand the C code underlying the operating system. For applica¬ 
tions programming and scientific programming, there is discussion of how to write 
sample function libraries. Statistics, root finding, sorting, text manipulation, file han¬ 
dling, and game playing are all represented with working code. 
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Flexible Organization. This book is constructed to be very flexible in its use. Chapter 1, 
“An Overview of C,” is in two parts. The first part explains the crucial programming 
techniques needed for interactive input/output, material that must be understood by 
all. The second part of Chapter 1 goes on to survey the entire language and will be com¬ 
prehensible to experienced programmers familiar with comparable features from other 
languages. This second part can be postponed in a first programming course. Caution: 
Beginning programmers should postpone the second part of Chapter 1. 

Chapter 2, “Lexical Elements, Operators, and the C System,” describes the lexical level 
of the language and syntactic rules, which are selectively employed to illustrate C. lan¬ 
guage constructs. The instructor may decide to teach Backus-Naur-Form (BNP) notation 
as described in Chapter 2 or may omit it without any loss of continuity. The book uses 
BNF-style syntactic descriptions so that the student can learn this standard form of 
programming language description. In addition, language components are thoroughly 
described by example and ordinary explanation. 

Reference Work. This book is designed to be a valuable reference to the C language. 
Throughout the book, many tables concisely illustrate key areas of the language. The 
complete ANSI C standard library, along with its associated header files, is described in 
the Appendix A, “The Standard Library.” Sections in the appendix are devoted to 
explaining each of the standard header files such as ctype.h, stdio.h, and string.h. Where 
appropriate, example code is given to illustrate, the use of a particular construct or 
function. 

In Appendix B, “Language Syntax," we provide the complete syntax of the C language. 
In Appendix C, “ANSI C Compared to Traditional C,” we list the major differences 
between ANSI C and traditional C. Finally, special care has been taken to make the index 
easy to use and suitable for a reference work. 

The Complete ANSI C Language, Chapters 3 through 10 cover the C language feature 
by feature. Many advanced topics are discussed that may be omitted on first reading 
without loss of comprehension, if so desired. For example, enumeration types are rela¬ 
tively new to the language, and their use can be omitted in a first course. Machine- 
dependent features such as word size considerations and floating-point representation 
are emphasized, but many of the details need not concern the beginner. 

The Preprocessor. Chapter 8, “The Preprocessor,” is devoted entirely to the preproces¬ 
sor, which is used to extend the power and notation of the C language. Macros can be 
used to generate inline code that takes the place of a function call. Their use can reduce 
program execution time. The chapter presents a detailed discussion of the preproces¬ 
sor, including new features added by the ANSI committee. In traditional C, the prepro¬ 
cessor varies considerably from one compiler to another. In ANSI C, the functionality of 
the preprocessor has been completely specified. 
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Zero is the natural starting point in the C programming language. C counts from 0. C 
uses 0 to mean false and not 0 to mean true. C array subscripts have 0 as a lower 
bound. C strings use 0 as an end-of-string sentinel. C pointers use 0 to designate a null 
value. C external and static variables are initialized to 0 by default. This book explains 
these ideas and initiates you into the pleasures of programming in C. 

C is a general-purpose programming language that was originally designed by Dennis 
Ritchie of Bell Laboratories and implemented there on a PDP-11 in 1972. It was first 
used as the systems language for the UNIX operating system. Ken Thompson, the devel¬ 
oper of UNIX, had been using both an assembler and a language named B to produce 
initial versions of UNIX in 1970. C was invented to overcome the limitations of B. 

B was a programming language based on BCPL, a language developed by Martin Rich¬ 
ards in 1967 as a typeless systems programming language. Its basic data type was the 
machine word, and it made heavy use of pointers and address arithmetic. This is con¬ 
trary to the spirit of structured programming, which is characterized by the use of 
strongly typed languages, such as the ALGOL-like languages. C evolved from B and 
BCPL, and it incorporated typing. 

By the early 1980s, the original C language had evolved into what is now known as 
traditional C by adding the void type, enumeration types, and some other improve¬ 
ments. In the late 1980s, the American National Standards Institute (ANSI) Committee 
X3J11 created draft standards for what is known as ANSI C or standard C. The commit¬ 
tee added the voi d * type, function prototypes, a new function definition syntax, and 
more functionality for the preprocessor, and in general made the language definition 
more precise. Today, ANSI C is a mature, general-purpose language that is widely avail¬ 
able on many machines and in many operating systems. It is one of the chief industrial 
programming languages of the world, and it is commonly found in colleges and univer¬ 
sities everywhere. Also, ANSI C is the foundation for C++, a programming language that 
incorporates object-oriented constructs. This book describes the ANSI version of the C 
language, along with some topics in C++ and Java. 
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C is not without criticism. It has a complicated syntax. It has no automatic array 
bounds checking. It makes multiple use of such symbols as * and =. For example, a 
common programming error is to use the operator = in place of the operator ==. Never¬ 
theless, C is an elegant language. It places no straitjacket on the programmer’s access 
to the machine. Its imperfections are easier to live with than a perfected restrictiveness. 

C is appealing because of its powerful operators and its unfettered nature. A C pro¬ 
grammer strives for functional modularity and effective minimalism. A C programmer 
welcomes experimentation and interaction. Indeed, experimentation and interaction are 
the hallmarks of this book. 


0.2 ANSI C Standard 

The acronym ANSI stands for “American National Standards Institute.” This institute is 
involved in setting standards for many kinds of systems, including programming lan¬ 
guages. In particular, ANSI Committee X3J11 is responsible for setting the standard for 
the programming language C. In the late 1980s, the committee created draft standards 
for what is known as ANSI C or standard C. By 1990, the committee had finished its 
work, and the International Organization for Standardization (ISO) approved the stan¬ 
dard for ANSI C as well. Thus, ANSI C, or ANSI/ISO C, is an internationally recognized 
standard. 

The standard specifies the form of programs written in C and establishes how these 
programs are to be interpreted. The purpose of the standard is to promote portability, 
reliability, maintainability, and efficient execution of C language programs on a variety 
of machines. Almost all C compilers now follow the ANSI C standard. 


0.3 From C to C++ 

Today, C is widely available on PCs, workstations, and mainframes throughout the 
world. At the same time, machines and operating systems continue to evolve. To 
expand the C language or to restrain the use of its constructs in order to conform to a 
particular discipline is not in the spirit of C. 

Although the C language itself is not being expanded, it often serves as the kernel for 
more advanced or more specialized languages. Concurrent C extends the language by 
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Chapter 1 


An Overview of C 


This chapter gives an overview of the C programming language. A series of programs is 
presented, and the elements of each program are carefully explained. Experimentation 
and interaction are emphasized throughout the text. In tills chapter, we emphasize how 
to use the basic input/output functions of C. Note carefully that all our C code also 
serves as C++ code and that all the ideas we discuss hold for C++ as well as for C. Of 
course, the C++ programmer has available a richer set of tools and techniques out of 
which programs can be constructed. (See Chapter 13, “Moving from C to C++.”) 

Except for Section 1.8, “Arrays, Strings, and Pointers,” on page 36, everyone should 
read all the material in this chapter. Any reader who has had experience with arrays, 
pointers, and files in some other language can read all the sections of this chapter to 
get a more complete overview of C. Others can come back to the material when they 
feel they are ready. Everyone should read this chapter with the understanding that tech¬ 
nical details and further explanations will come in later chapters. 


1.1 Programming and Preparation 

Resident on the machine is a collection of special programs called the operating system. 
Commonly available operating systems include MS-DOS, OS/2, and UNIX. An operating 
system manages machine resources, provides software for the user, and acts as an 
interface between the user and the hardware. Among the many software packages pro¬ 
vided by the operating system are the C compiler and various text editors. The principal 
text editor on the UNIX system is called vt. Some systems, such as Borland C++, inte¬ 
grate the text editor and the compiler. We assume that the reader is able to use some 
text editor to create files containing C code. Such files are called source files, and they 
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are compiled on most UNIX systems with the cc command, which invokes the C com¬ 
piler. Because the cc command invokes the compiler, the name of the command is also 
t e name of the compiler. Thus, C compiler and cc compiler are used interchangeably. 
Roughly speaking, a compiler translates source code to object code that is executable. 
" systems > this compiled code is automatically created in a file named a.out On 

m as rhp^-i’ C °™ pi | ed code is automatically created in a file with the same 
. as the .c file, but with the .exe extension replacing the .c extension. At the end of 

presentTriet^tl GCtl0n 1-1 °’ “ 0peratin § System Considerations,” on page 53, we 
p escnt in detail the steps necessary to edit, compile, and execute a program. 


1.2 Program Output 

onX^creTrthp°nn mUn i‘ C f ate t0 be USeM : 0ur first exampIe is a Program that prints 

the phrase from sea to shining C.” The complete program is 

In file sea.c 

#inc1ude <stdio.h> 

int main(void) 

{ 

printf("from sea to shininq C\n")- 
return 0: ’ 

} 

name W * tyP6 tWs int0 a file whose name ends in - c - The choice of a file 

sea To mmn tr em0mC - Let US suppose the Program has been written in the file 
sea.c, to compile the program, we give the command 

cc sea.c 

If there are no errors in the code, the executable file a.out is created by this command 
Now the command ™ I1U ' 

a.out 

executes the program and prints on the screen 


from sea to shining C 


1.2 ▼ Program Output 






MM 




Dissection of the sea Program 

m #inc1ude <stdio.h> 

A preprocessor is built into the C compiler. When the command to compile a program 
is given, the code is first preprocessed and then compiled. Lines that begin with a # 
communicate with the preprocessor. This #i ncl ude line causes the preprocessor to 
include a copy of the header file stdio.h at this point in the code. This header file is pro¬ 
vided by the C system. The angle brackets around <stdi o. h> indicate that the file is to 
be found in the usual place, which is system-dependent. We have included this file 
because it contains information about the pri ntf () function. 

■ int main(void) 

This is the first line of the function definition for mai n(). (We write parentheses after 
the name mai n to remind the reader that mai n() is a function.) The two words i nt and 
voi d are keywords, also called reserved words. They have special meaning to the com¬ 
piler. In Section 2.4, “Keywords,” on page 77, we wall see that there are 32 keywords in 
C, including i nt and voi d. 

■ int main(void) 

{ 


Every program has a function named mai n (). Program execution always starts with this 
function. The top line should be read as “mai n() is a function that takes no arguments 
and returns an i nt value.” Here, the keyword i nt tells the compiler that this function 
returns a value of type i nt. The word i nt stands for integer, but the word integer itself 
cannot be used. The parentheses following mai n indicate to the compiler that mai n is a 
function. This idea is confusing at first because what you see following mai n is (voi d), 
but only the parentheses () constitute an operator telling the compiler that mai n is a 
function. The keyword voi d indicates to the compiler that this function takes no argu¬ 
ments. When we write about functions such as mai n() and pri ntf (), we usually follow 
the name in print with parentheses. This indicates to the reader that we are discussing 
a function. (Many programming books follow this practice.) 


Braces surround the body of a function definition. They are also used to group state¬ 
ments together. 
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il printfC) 

The C system contains a standard library of functions that can be used in programs. 
This is a function from the library that prints on the screen. We included the header file 
stdio.h because it provides certain information to the compiler about the function 
printfC). (See exercise 14, on page 63.) 

a "from sea to shining C\n" 

A string constant in C is a series of characters surrounded by double quotes. This string 
is an argument to the function pri ntf (), and it controls what gets printed. The two 
characters \n at the end of the string (read backslash n) represent a single character 
called newline. It is a nonprinting character. It advances the cursor on the screen to the 
beginning of the next line. 

a printf("from sea to shining C\n") 

This is a call to the pri ntf () function. In a program, the name of a function followed 
by parentheses causes the function to be called, or invoked. If appropriate, the paren¬ 
theses may contain arguments. Here, when the pri ntf () function is invoked, it prints 
its argument, a string constant, on the screen. 

a printf("from sea to shining C\n"); 

This is a statement. Many statements in C end with a semicolon. 

■ return 0; 

This is a return statement. It causes the value zero to be returned to the operating sys¬ 
tem, which in turn may use the value in some way, but is not required to do so. (See Sec¬ 
tion 12.7, “Returning the Status,” on page 579, for further discussion.) Our use of this 
return statement keeps the compiler happy. If we do not use it, the compiler will com¬ 
plain. (See exercise 4, on page 60.) One of the principal rules of programming is "keep 
your compiler happy." 

B } 

The right brace matches the left brace above, ending the function definition for mai n (). 

Hi 
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The function pri ntf () acts to print continuously across the screen. It moves the 
cursor to the start of a new line when a newline character is read. The screen is a 
two-dimensional display that prints from left to right and top to bottom. lo be read¬ 
able, output must appear properly spaced on the screen. 

We can rewrite our first program as follows: 

#include <stdio.h> 

int main(void) 

{ 

pri ntf("from sea to ); 
printf("shining C"); 
pri ntf ("\n"); 
return 0; 

} 

Although it is different from the first version, it will produce the same output. Each 
time pri ntf () is called, printing begins at the position where the previous call to 
pri ntf () left off. If we want to print our phrase on three lines, we can use newline 

characters. 

#include <stdio.h> 


int main(void) 

{ 

printf("from sea\n"); 
printf("to shining\nC\n"); 
return 0; 

} 

When executed, this program will print 


from sea 
to shining 
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characters, is significant and when ithT character including blanks and newline 
proportions. ' S executed - u will give some sense of the screen 

In file sea2.c 

^include <stdio.h> 
int main(void) 

JJrJ^^"\ nXn \ n \ n \ n \n\n\n\n\n"); 

p p 4 n x: ^ n ******* * * * * * * * * * * * * * * **\fl") 1 

pn'ntf(" * r 

tin*ntfto shining C ^ \ n r 1 1 * 

^ ****A*A*AAA***A**A*****\ i,^ J 

Su"Jn C 0^ XnV " X " XnXnX "^ n \""^ 



1.3 


Variables, Expressions, and Assignment 


ImelerftoESC« “““In ^,7“°' , 1 T“ h “ to 

numbers are integers To conven t 1 1 “^ ,0 b ° 26 1111168 and 383 y^ 8 - These 
factor 1.609, a real nLberT^^f *° We ^ the conversion 

reals. To convert yards to miles we T** differentl y from 

to represent this number as a mil rathe^ than as ^ * iS eSSentM 

valuel^^C^all variabkTmu^b^declared^o ^ ° f I 0 ™ 8 integer values ^ d «al 
A variable name, also called an identify^oL^Tof a'Ur" '***?? ° f ** program ' 

underscores, but may not start with a diStZ,2? f Tlf 01 letters - digits - and 
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In file marathon, c 

/* The distance of a marathon in kilometers. */ 

#include <stdio.h> 

int main(void) 

{ 

int miles, yards; 
float kilometers; 

miles = 26; 
yards = 385; 

kilometers = 1.609 * (miles + yards / 1760.0); 
printf("\nA marathon is %f kilometers.\n\n", kilometers) 
return 0; 


The output of the program is 


A marathon is 42.185970 kilometers 




Dissection of the marathon Program 

■ /* The distance of a marathon in kilometers. */ 

Anything written between the characters /* and */ is a comment and is ignored by the 
compiler. All programs in this book that start with a comment are listed in the index. 


int 


miles, yards; 


This is a declaration. Declarations and statements end with a semicolon, i nt is a key¬ 
word and is one of the fundamental types of the language. It informs the compiler that 
the variables following it are of type i nt and are to take on integer values. Thus, the 
variables mi 1 es and yards in this program are of type i nt. 


■ float kilometers; 

This is a declaration, float is a keyword and is one of the fundamental types of the 
language. It in forms the compiler that the variables following it are of type f 1 oat and 
are to take on real values. Thus, the variable ki lometers in this program is of type 

Wm* 

t I oat. 
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a miles = 26; 
yards = 385; 

These are assignment statements. The equal sign is an assignment opeiator. The two 
numbers 26 and 385 are integer constants. The value 26 is assigned to the variable 
miles. The value 385 is assigned to the variable yards. 

■ kilometers = 1.609 * (miles + yards / 1760.0), 

This is an assignment statement. The value of the expression on the right side of the 
equal sign is assigned to the variable ki 1 ometers. The operators *, +, and / stand oi 
multiplication, addition, and division, respectively. Operations inside parentheses are 
performed first. Because division has higher precedence than addition, the value of the 

subexpression 

yards / 1760.0 

is calculated first. (See Appendix E, “Operator Precedence and Associativity.”) That 
value is added to the value of the variable mi 1 es to produce a value that is then mul l- 
plied by 1.609. This final value is then assigned to the variable kilometers. 

■ printf ("\nA marathon is %f kilometers.\n\n , kilometers), 

This is a statement that invokes, or calls, the printf() function. The function 
printf () can have a variable number of arguments. The first argument is always a 
string, called the control string. The control string in this example is 


H \nA marathon is %f kilometers.\n\n f 


It is the first argument to the function printf(). Inside this string is the conversion 
specification, or format, %f. The formats in a control string, if any, are matched with 
the remaining arguments in the pri ntf () function. In this case, %f is matched with the 
argument kilometers. Its effect is to print the value of the variable kilometers as a 
floating-point number and insert it into the print stream where the format %f occurs. 



Certain words, called keywords are reserved and cannot be used by the programmer 
as names of variables. For example, i nt, f 1 oat, and doubl e are keywords. A table of 
keywords appears in Section 2.4, “Keywords,” on page 77. Other names are known o 
the C system and normally would not be redefined by the programmer. The name 
printf is an example. Because printf is the name of a function in the slant arc 

library, it usually is not used as the name of a variable. 
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1.4 ▼ The Use of #define and #include 




A decimal point in a number indicates that it is a floating-point constant rather than 
an integer constant. Thus, the numbers 37 and 37.0 would be treated differently in a 
program. Although there are three floating types—float, double, and long double— 
and variables can be declared to be of any of these types, floating constants are auto¬ 
matically of type doubl e. 

Expressions typically are found on the right side of assignment operators and as 
arguments to functions. The simplest expressions are just constants, such as 385 and 
1760.0, which were used in the previous program. The name of a variable itself can be 
considered an expression, and meaningful combinations of operators with variables 
and constants are also expressions. 

The evaluation of expressions can involve conversion rules. This is an important 
point. The division of two integers results in an integer value, and any remainder is dis¬ 
carded. Thus, for example, the expression 7/2 has int value 3. The expression 7.0/2, 
however, is a double divided by an int. When the expression 7.0/2 is evaluated, the 
value of the expression 2 is automatically converted to a doubl e, causing 7.0/2 to have 
the value 3.5. In the previous program, suppose that the statement 





msm 


kilometers = 1.609 * (miles + yards / 1760.0); 
is changed to 




m 


kilometers = 1.609 * (miles + yards / 1760); 




rm 9 

m; 

- ?! 

.HI 
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!9This leads to a program bug. Because the variable yards is of type int and has value 
j| 385 , the integer expression 
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Ifcuses integer division, and the result is the i nt value 0. This is not what is wanted. Use 
Bliof the constant 1760.0, which is of type doubl e, corrects the bug. 
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The Use of #define and #incl ude 
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he C compiler has a preprocessor built into it. Lines that begin with a # are called pre¬ 
processing directives . If the lines 




#define 
#define 
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PI 


3.14159 
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occur in a file that is being compiled, the preprocessor first changes all occurrences of 
the identifier LIMIT to 100 and all occurrences of the identifier PI to 3.14159, except 
in quoted strings and in comments. The identifiers LIMIT and PI are called symbolic 
constants. A #defi ne line can occur anywhere in a program. It affects only the lines in 
the file that come after it. 

Normally, all #define lines are placed at the beginning of the file. By convention, all 
identifiers that are to be changed by the preprocessor are written in capital letters. The 
contents of quoted strings are never changed by the preprocessor. For example, in the 


printf("PI = %f\n", PI); 

only the second PI will be changed by the above #define directives to the preproces¬ 
sor. The use of symbolic constants in a program make it more readable. More impor¬ 
tantly, if a constant has been defined symbolically by means of the #defi ne facility and 
used throughout a program, it is easy to change it later, if necessary. For example, in 
physics the let ter c is often used to designate the speed of light, which is approximately 
299792.458 km/sec. If we write 

#define C 299792.458 /* speed of light in km/sec */ 

and then use C throughout thousands of lines of code to represent symbolically the 
constant 299792.458, it will be easy to change the code when a new physical experi¬ 
ment produces a better value for the speed of light. All the code is updated by simply 
changing the constant in the #defi ne line. 

In a program, a line such as 

#include "my_file.h" 

is a preprocessing directive that causes a copy of the file my^file.h to be included at this 
point in the file when compilation occurs. A #i ncl ude line can occur anywhere in a file, 
though it is typically at the head of the file. The quotes surrounding the name of the file 
are necessary. An include file, also called a header file, can contain #defi ne lines and 
other #i ncl ude lines. By convention, the names of header files end in A 
The C system provides a number of standard header files. Some examples are stdio.h, 
string.h, and math.h. These files contain the declarations of functions in the standard 
library, macros, structure templates, and other programming elements that are com¬ 
monly used. As we have already seen, the preprocessing directive 

#include <stdio.h> 

causes a copy of the standard header file stdio.h to be included in the code when compi¬ 
lation occurs. In ANSI C, whenever the functions printfQ or scanf Q are used, the 
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standard header file stdio.h should be included. This file contains the declarations, or 
| more specifically, the function prototypes, of these functions. (See Section 1.7, “Func- 
1 tions,” on page 29, for further discussion.) 

The Santa Cruz campus of the University of California overlooks the Monterey Bay on 
| the Pacific Ocean and some of the ocean just to the northwest of the bay. We like to call 
| this part of the ocean that is visible from the campus the “Pacific Sea.” To illustrate how 
| the #i ncl ude facility works, we will write a program that prints the area of the Pacific 
| Sea in various units of measure. First, we create a header file and put in the following 
lines: 




pin file pacific_sea.h 


>5 - 


#include <stdio.h> 




#define 
#define 
#define 
#define 
#define 


AREA 

SQ_MILES_PER_SQ_KILOMETER 

SVFEET_PER_SQ_MILE 

SQ_INCHES_PER_SQ_FOOT 

ACRES_PER_SQ_MILE 


2337 

0.3861021585424458 
(5280 * 5280) 

144 

640 


\ «/ / v> <*,. 

■HINext we write the function mai n() in a .c file. 




Hill In file pacific_sea.c 

fTv' ■■ .; 
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/* Measuring the Pacific Sea. */ 
#include ”pacific_sea.h" 

int main (void) 

{ 

const int pacific_sea = AREA 


double 


/* in sq kilometers */ 


acres, sq_miles, sq_feet, sq_inches; 


Vp-pHt ' 
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printf("\nThe Pacific Sea covers an area"); 
printf(" of %d square kilometers.\n", pacific_sea); 
sq_miles = SQ_MILES_PER_SQ_KILOMETER * pacific_sea; 
sq_feet = SQ_FEET_PER_SQ_MILE * sq_miles; 
sq_inches = SQ_INCHES_PER_SQ_FOOT * sq_feet; 
acres = ACRES_PER_SQ_MILE * sq_miles; 
printf("In other units of measure this is:\n\n"); 
printf("%22.7e acres\n", acres); 
printf("%22.7e square miles\n", sq_niiles); 
printf("%22.7e square feet\n", sq.feet); 
printf("%22.7e square inches\n\n", sq_inches); 
return 0: 
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Now our program is written in two files, a .h file and a .c file. The output of this pro¬ 
gram is 

The Pacific Sea covers an area of 2337 square kilometers. 

In other units of measure this is: 

5.7748528e+05 acres 
9,0232074e+02 square miles 
2.5155259e+10 square feet 
3.6223572e+12 square inches 

The new programming ideas are described in the following dissection table. 




Dissection of the pacific_sea Program 


■ #include "pacific_sea.h" 

This #i ncl ude line is a preprocessing directive. It causes a copy of the file pacific^sea.h 
to be included when the program is compiled. Because this file contains the line 


#include <stdio.h> 


the preprocessor expands the line in turn and includes a copy of the standard header 
file stdio.h in the code as well. We have included stdio.h because we are using pri ntf (). 
Five symbolic constants are defined in pacific_sea.h. 


■ #define AREA 2337 

This #def i ne line is a preprocessing directive. It causes the preprocessor to replace all 
occurrences of the identifier AREA by 2337 in the rest of the file. By convention, capital 
letters are used for identifiers that will be changed by the preprocessor. If at some 
future time a new map is made and a new figure for the area of the Pacific Sea is com¬ 
puted, only this line needs to be changed to update the program. 

■ #define SQ_MILES_PER_SQ_KILOMETER 0.3861021585424458 

The floating constant 0.3861021585424458 is a conversion factor. The use of a sym¬ 
bolic name for the constant makes the program more readable. 
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■ #define SQ_FEET_PER_SQ_MILE (5280 * 5280) 

The preprocessor changes occurrences of the first sequence of characters into the sec¬ 
ond. If a reader of this program knows that there are 5280 feet in a mile, then that 
reader will quickly recognize that this line of code is correct. Instead of (5280 * 
5280), we could have written 27878400; because C compilers expand constant expres¬ 
sions during compilation, run-time efficiency is not lost. Although the parentheses are 
not necessary, it is considered good programming practice to use them. For technical 
reasons parentheses are often needed around symbolic expressions. (See Section 8.3, 
“Macros with Arguments,” on page 368.) 

B const int pacific_sea = AREA; /* in sq kilometers */ 

When compiled, the preprocessor first changes AREA to 2337. The compiler then inter¬ 
prets this line as a declaration of the identifier paci f i c_sea. The variable is declared 
as type i nt and initialized to the value 2337. The keyword const is a type qualifier that 
has been newly introduced by ANSI C. It means that the associated variable can be ini¬ 
tialized, but cannot thereafter have its value changed. (See exercise 18, on page 65.) On 
some systems this means that the variable may be stored in ROM (read-only memory). 

B double acres, sq_miles, sq_feet, sq_inches; 

These variables are defined to be of type double. In ANSI C, floating types are float, 
double, and long double; long double does not exist in traditional C. Each of these 
types is used to store real values. Typically, a f 1 oat will store 6 significant digits and a 
doubl e will store 15 significant digits. Along double will store at least as many signif¬ 
icant digits as a double. (See Section 3.6, “The Floating Types,” on page 119.) 

a printf("%22.7e acres\n", acres); 

Tins statement causes the line 

5,7748528e+05 acres 

to be printed. The number is written in scientific notation and is interpreted to mean 
5.7748528 x 10 s . Numbers written this way are said to be written in an e-format. The 
conversion specification %e causes the system to print a floating expression in an e-for¬ 
mat with default spacing. A format of the form %m.ne, where m and n are positive inte¬ 
gers, causes the system to print a floating expression in an e-format in m spaces total, 
with n digits to the right of the decimal point. (See Section 11.1, “The Output Function 
pri ntf (),” on page 493.) 
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1.5 The Use of printf() and scanf() 

The function printfO is used for output. In an analogous fashion, the function 
scanf () is used for input. (The f in pri ntf and scanf stands for formatted.) Techni¬ 
cally, these functions are not part of the C language, but rather are part of the C system. 
They exist in a library and are available for use wherever a C system resides. Although 
the object code for functions in the library is supplied by the C system, it is the respon¬ 
sibility of the programmer to declare the functions being used. ANSI C has introduced a 
new and improved land of function declaration called a function prototype. This is one 
of the most important changes introduced into the language by ANSI C. The function 
prototypes of functions in the standard library are available in the standard header 
files. In particular, the function prototypes for pri ntf () and scanf () are in stdio.h. 
Thus, this header file should be included whenever the function pri ntf () or scanf () 
is used. (See Section 1.7, “Functions,” on page 29.) 

Both pri ntf () and scanf () are passed a list of arguments that can be thought of as 

controLstring and other^arguments 

where controLstring is a string and may contain conversion specifications, or formats. 
A conversion specification begins with a % character and ends with a conversion charac¬ 
ter. For example, in the format %d the letter d is the conversion character. As we have 
already seen, this format is used to print the value of an integer expression as a decimal 
integer. To print the letters on the screen, we could use the statement 

printf("abc"); 

Another way to do this is with the statement 
printf("%s", "abc"); 

The format %s causes the argument "abc" to be printed in the format of a string. Yet 
another way to do this is with the statement 

printf("%c%c%c", 'a', 'b', 'c'); 

Single quotes are used to designate character constants. Thus, ' a' is the character con¬ 
stant corresponding to the lowercase letter a. The format %c prints the value of an 
expression as a character. Notice that a constant by itself is considered an expression. 
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printfO conversion characters 

Conversion 

character 

How the corresponding argument is printed 

c 

as a character 

d 

as a decimal integer 

e 

as a floating-point number in scientific notation 

f 

as a floating-point number 

g 

in the e-format or f-format, whichever is shorter 

S 

as a string 


When an argument is printed, the place where it is printed is called its field and the 
number of characters in its field is called its field width. The field width can be specified 
in a format as an integer occurring between the % and the conversion character. Thus, 
the statement 

pri ntf("%c%3c%5c\n”, 'A', 'B', 'C); 
will print 

A B C 

The function scanf 0 is analogous to the function pri ntf () but is used for input 
rather than output. Its first argument is a control string having formats that corre¬ 
spond to the various ways the characters in the input stream are to be interpreted. The 
other arguments are addresses. Consider, for example, the statement 

scanf("%d", &x); 

The format %d is matched with the expression &x, causing scanf () to interpret charac¬ 
ters in the input stream as a decimal integer and store the result at the address of x. 
Read the expression &x as “the address of x” because & is the address operator. 

When the keyboard is used to input values into a program, a sequence of characters 
is typed, and it is this sequence of characters, called the input stream, that is received 
by the program. If 1337 is typed, the person typing it may think of it as a decimal inte¬ 
ger, but the program receives it as a sequence of characters. The scanf () function can 
be used to convert a string of decimal digits into an integer value and to store the value 
at an appropriate place in memory. 
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e unction scanf() returns an int value that is the number of successful conver¬ 
sions accomplished or the system defined end-of-value. The function pri ntf () returns 

an i nt va ue that is the number of characters printed or a negative value in case of an 
error. 


scanf () conversion 


Conversion 

character 


If or LF 


How characters in the input stream are converted 

character 
decimal integer 
floating-point number (float) 
floating-point number (double) 
string 


Ihe details concerning printf () and scanf () are found in Section ll.l, “The Output 

on °’“ ° n 493 ’ 3nd ln SeCti0n 1 L2 > “ The Input Function scanf O," 

Of thffma m eie ’ VVe ° n y Want t0 present enough information to get data into and out 

characters"amf m & mmin * ally a( ' a 'ptable way. The following program reads in three 

Char are F a H t S °f e "T md thm prilltS thcm out - Notice that variables of tvpe 
char aie used to store character values. 

In file echo.c 


#include <stdio.h> 


int main(void) 
{ 

char cl 
int i ; 
float x; 
double y; 


cl, c2, c3; 

i * 

* > 


printf("\n%s\n%s", "Input three characters," 
an int, a float, and a double: "); 
scanff %c%c%c%d%f%lf". &cl, &c 2 , &c3, &i &x &y) ■ 
printff \nHere is the data that you typed in An")■ 

pn ntf ( %3c%3c%3c%5d%17e%17e\n\n , cl c 2 c3 i x v) ■ 

return 0- ’ ’ x ’ “b 
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If we compile the program, run it, and type in ABC 3 5 5 77.7, then this is what 
appears on the screen: 

Input three characters, 

an int, a float, and a double: ABC 3 55 77.7 

Here is the data that you typed in: 

ABC 3 5.500000e+01 7.770000e+01 

When reading in numbers, scanf () will skip white space (blanks, newlines, and tabs), 
but when reading in a character, white space is not skipped. Thus, the program wall not 
run correctly with the input AB C 3 55 77.1. The third character read is a blank, 
which is a perfectly good character; but then scanf () attempts to read C as a decimal 
integer, which causes difficulties. 


1.6 Flow of Control 

Statements in a program are normally executed in sequence. However, most programs 
require alteration of the normal sequential flow of control. The i f and i f-el se state¬ 
ments provide alternative actions, and the while and for statements provide looping 
mechanisms. These constructs typically require the evaluation of logical expressions, 
expressions that the programmer thinks of as being either true or false. In C, any non¬ 
zero value is considered to represent true, and any zero value is considered to repre¬ 
sent false. 

The general form of an i f statement is 

if ( expr ) 
statement 

If expr is nonzero (true), then statement is executed; otherwise, it is skipped. It is impor- 
tant to recognize that an i f statement, even though it contains a statement part, is 
itself a single statement. Consider as an example the code 

a T T * 

JL f 

if (b == 3) 
a = 5; 

printf("%d", a); 

The symbols == represent the is equal to operator. In the code above, a test is made to 
see if the value of b is equal to 3. If it is, then a is assigned the value 5 and control 
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passes to the pri ntf () statement, causing 5 to be printed. If, however, the value of b is 
not 3, then the statement 

a = 5; 

is skipped and control passes directly to the pri ntf () statement, causing 1 to be 
printed. In C, logical expressions have either the i nt value 1 or the i nt value 0. Con¬ 
sider the logical expression 


This expression has the i nt value 1 (true) if b has the value 3; otherwise, it has the i nt 
value 0 (false). 

A group of statements surrounded by braces constitutes a compound statement. Syn¬ 
tactically, a compound statement is itself a statement; a compound statement can be 
used anywhere that a statement can be used. The next example uses a compound state¬ 
ment in place of a simple statement to control more than one action: 

if (a == 3) { 
b = 5; 
c = 7; 

} 

Here, if a has value 3, then two assignment statements are executed; if a does not have 
value 3, then the two statements are skipped. 

An i f-el se statement is of the form 

if iexpf) 
statement1 
el se 

statement2 

It is important to recognize that the whole construct, even though it contains state¬ 
ments, is itself a single statement. If expr is nonzero (true), then statement 1 is executed; 
otherwise statement^ is executed. As an example, consider the code 




1.6 ▼ Flow of Control 


23 



This construct is a whi 1 e statement, or whi 1 e loop. The symbols <= represent the less 
than or equal to operator. A test is made to see if i is less than or equal to 5. If it is, the 
group of statements enclosed by the braces { and } is executed, and control is passed 
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back to the beginning of the while loop for the process to start over again. The while 
loop is repeatedly executed until the test fails—that is, until i is not less than or equal 
to 5. When the test fails, control passes to the statement immediately following the 
whi 1 e statement, which in this program is a pri ntf () statement. 

■ sum += i; 

This is a new kind of assignment statement. It causes the stored value of sum to be 
incremented by the value of i. An equivalent statement is 

sum = sum + i; 

The variable sum is assigned the old value of sum plus the value of i. A construct of the 
form 


variable op= expr 

where op is an operator such as +, *, or / is equivalent to 

variable = variable op ( expr ) 

■ ++i ; 

C uses ++ and — to increment and decrement, respectively, the stored values of vari¬ 
ables. The statement 


++i; is equivalent to i = i + 1; 

In a similar fashion, —i will cause the stored value of i to be decremented. (See Section 

2.10, “Increment and Decrement Operators,” on page 85, for further discussion of these 
operators.) 



A hand simulation of the program shows that the whi 1 e loop is executed five times, 
with i taking on the values 1, 2, 3, 4, 5 successively. When control passes beyond the 
while statement, the value of i is 6, and the value of sum is 


i + 2 + 3 + 4 + S which is equal to 15 

This is the value printed by the pri ntf () statement. 
The general form of a whi 1 e statement is 
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while (.expr) 
statement 

where statement is either a simple statement or a compound statement. When the 
while statement is executed, expr is evaluated. If it is nonzero ( true ), then statement is 
executed and control passes back to the beginning of the whi 1 e loop. This process con¬ 
tinues until expr has value 0 (false). At this point, control passes on to the next state¬ 
ment. In C, a logical expression such as i <= 5 has i nt value 1 (true) if i is less than or 
equal to 5, and has i nt value 0 (false) otherwise. 

Another looping construct is the for statement. (See Section 4.9, “The for State¬ 
ment,” on page 167, for a more complete discussion.) It has the form 

for (expr1\ expr2; expr3) 
statement 

If all three expressions are present, then this is equivalent to 
exprl ; 

while (expr2) { 
statement 
expr3 ; 

} 

Typically, exprl performs an initial assignment, expr2 performs a test, and expr3 incre¬ 
ments a stored value. Note that expr3 is the last thing done in the body of the loop. The 
for loop is repeatedly executed as long as expr2 is nonzero (true). For example, 

for (i =1; i <= 5; ++i) 
sum += i; 

This for loop is equivalent to the whi 1 e loop used in the last program. 

Our next program illustrates the use of an if-else statement within a for loop. 
Numbers are read in one after another. On each line of the output we print the count 
and the number, along with the minimum, maximum, sum, and average of all the num¬ 
bers seen up to that point. (See exercise 16, on page 65, through exercise 18, on page 
65, for further discussion concerning the computation of the average.) 
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In file running_sum.c 

/* Compute the minimum, maximum, sum, and average. */ 

#include <stdio.h> 

#include <std1ib.h> 

int main(void) 

{ 

int i; 

double x, min, max, sum, avg; 

if (scant, &x) 1= 1) { 

printf("No data found - bye!\n"); 
exit(l); 

> 

min = max = sum = avg = x; 

printf("%5s%9s%9s%9s%12s%12s\n%5s%9s%9s%9s%12s%12s\n\n", 

"Count", "Item", "Min", "Max", "Sum", "Average", 

r._i. m _„ „_„_„ .._„ fi_ 

printf("%5d%9.lf%9!lf%9.lf%12. 3f%12 .3f\n", 

1, x, min, max, sum, avg); 

for (i = 2; scanf("%lf", &x) == 1; ++i) { 
i f (x < mi n) 
min = x; 

else if (x > max) 
max = x; 
sum += x; 
avg = sum / i; 

printf("%5d%9.lf%9.lf%9.lf%12.3f%12.3f\n", 
i, x, min, max, sum, avg); 

} 

return 0; 

} 

This program has been designed to read numbers from a file. We can type them in from 
the keyboard, but if we do this, then what appears on the screen will not be formatted 
correctly. To test this program, we compile it and put the executable code in 
running_sum. Then we create a file called data and put the following numbers in it: 

3-5 7-9 11 -13 15 -17 19 -21 

Now, when we give the command 


rurm'mg_sum < data 
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the following appears on the screen: 


Count 

Item 

Mi n 

Max 

Sum 

Average 

1 

3.0 

3.0 

3.0 

3.000 

3.000 

2 

-5.0 

-5.0 

3.0 

-2.000 

-1.000 

3 

7.0 

-5.0 

7.0 

5.000 

1.667 


The use of the symbol < in the command 
running_sam < data 

causes the input to be redirected. The program running.sum takes its input from the 
standard input file, which is normally connected to the keyboard. The operating sys¬ 
tem, however, has redirected the input to the file data. In this context, the symbol < is 
thought of as a left pointing arrow. (See Section 1.10, “Operating System Consider¬ 
ations,” on page 53, for further discussion.) 





Dissection of the mnning_sum Program 

a if (scanf("%lf", &x) != 1) { 

printf("No data found - bye!\n"); 
exit(l); 

} 

Recall that scanf () returns as an int the number of successful conversions per¬ 
formed. If scanf () is unable to make a conversion, then we print a message and exit 
the program. The function exi t() is in the standard library, and its function prototype 
is in stdlib.h. When exit() is invoked, certain housekeeping tasks are performed and 
the program is terminated. This function takes a single argument of type int that, by 
convention, is zero if the programmer considers the exit to be normal, and is nonzero 
otherwise. 


pri ntf ("%5s%9s%9s%9s% 


"Count 


ii if 

5 

ll if 

$ 


Item 


!l I 

5 

II I 

$ 


'Min", "Max", "Sum", "Average", 


ti ft 
J 


n ti 

3 


II It 
5 


This statement prints headings. The field widths in the formats have been chosen to 
put headings over columns. 
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a pri ntf("%5d%9. lf%9. lf%9. lf%12.3f%12.3f\n n , 

If x, min, max, sum, avg); 

After the headings, this is the first line to be printed. Notice that the field widths here 
match the field widths in the previous pri ntf() statement. 

S for (i =2; scanf("%1f", &x) == 1; ++i) { 

The variable i is initially assigned the value 2. Then a test is made to see if the logical 
expression 

scanf, &x) == 1 

is true. If scanf () can read characters from the standard input stream, interpret them 
as a doubl e (read 1 f as “long float”), and place the value at the address of x, then a suc¬ 
cessful conversion has been made. This causes scanf () to return the i nt value 1, 
which in turn makes the logical expression true. As long as scanf () can continue to 
read characters and convert them, the body of the for loop will be executed repeatedly. 
The variable i is incremented at the end of the body of the loop. 

a if (x < min) 
min = x; 

else if (x > max) 
max = x; 

This construct is a single i f■-el se statement. Notice that the statement part following 
the el se is itself an i f statement. Each time through the loop this i f-el se statement 
causes the values for mi n and max to be updated, if necessary. 
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1.7 Functions 

The heart and soul of C programming is the function. A function represents a piece of 
code that is a building block in the problem-solving process. All functions are on the 
same external level; they cannot be nested one inside another. A C program consists of 
one or more functions in one or more files. (See Section 5.8, “Developing a Large Pro¬ 
gram,” on page 209.) Precisely one of the functions is a mai n() function, where execu¬ 
tion of the program begins. Other functions are called from within mai n() and from 
within each other. 

Functions should be declared before they are used. Suppose, for example, that we 
want to use the function pow(), called the power function, one of many functions in the 
mathematics library available for use by the programmer. A function call such as 
pow(x, y) returns the value of x raised to the y power. To give an explicit example, 
pow(2 .0, 3.0) yields the value 8.0. The declaration of the function is given by 

double pow(double x, double y); 

Function declarations of this type are called function prototypes. An equivalent function 
prototype is given by 

double pow(double, double); 

Identifiers such as x and y that occur in parameter type lists in function prototypes are 
not used by the compiler. Their purpose is to provide documentation to the program¬ 
mer and other readers of the code. 

A function prototype tells the compiler the number and type of arguments to be 
passed to the function and the type of the value that is to be returned by the function. 

ANSI C has added the concept of function prototype to the C language. This is an 
important change. In traditional C, the function declaration of pow() is given by 

double pow(); /* traditional style */ 

Parameter type lists are not allowed. ANSI C compilers will still accept this style, but 
function prototypes, because they greatly reduce the chance for errors, are much pre¬ 
ferred. (See exercise 5, on page 236, in Chapter 5, “Functions.”) 

A function prototype has the following general form: 

type function._name(jparameter type list) ; 
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The parameter type list is typically a list of types separated by commas. Identifiers are 
optional; they do not affect the prototype. The keyword voi d is used if a function takes 
no arguments. Also, the keyword voi d is used if no value is returned by the function. If 
a function takes a variable number of arguments, then ellipses ... are used. For exam¬ 
ple, the function prototype 

int printf(const char *format, ...); 

can be found in stdio.h. (See exercise 14, on page 63.) This information allows the com¬ 
piler to enforce type compatibility. Arguments are converted to these types as if they 
were following rules of assignment. 

To illustrate the use of functions, we set for ourselves the following task: 

Creating maxmin Program 

1 Print information about the program (this list). 

2 Read an integer value for n. 

3 Read in n real numbers. 

4 Find minimum and maximum values. 

Let us write a program called maxmin that accomplishes the task. It consists of three 
functions written in the file maxmin.c. 

In file maxmin.c 


#include <stdio.h> 

float maxi mum(float x, float y); 
float minimum(float x, float y) ; 
void prn_info(void); 

int main(void) 

{ 

int i, n; 
float max, min, x; 

prn_info() ; 

printf("Input n: "); 

scanf("%d", &n); 

printf("\nlnput %d real numbers: 
scanf("%f", &x); 
max = min = x; 


”, n); 
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for (i =2; i <= n; ++i) { 
scanf("%f", &x); 
max = maximum(max, x); 
min = minimum(min, x); 

} 

pri ntf("\n%s%ll.3f\n%s%ll.3f\n\n", 
"Maximum value:", max, 

"Minimum value:", min); 
return 0; 


float maximum(float x, float y) 

{ 

if (x > y) 
return x; 
el se 

return y; 

} 



float minimum(float x, float y) 

{ 

if (x < y) 
return x; 
el se 

return y; 

} 


void prn_info(void) 

. { 

printf("\n%s\n%s\n\n", 

"This program reads an integer value for n, and then", 
"processes n real numbers to find max and min values."); 

} 

To test the program, we give the command 




maxmin 


T 


Suppose, when prompted, we type in 5 followed by the line 




m I 










737.7799 -11.2e+3 -777 0.001 3.14159 












YjCESjJ'” ->■</£. ; 


Hi i&>: ■ 
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v. 

iwKS. 
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Here is what appears on the screen: 

This program reads an integer value for n, and then 
processes n real numbers to find max and min values. 

Input n: 5 

Input 5 real numbers: 737.7799 -11.2e+3 -777 0.001 3.14159 

Maximum value: 737.780 

Minimum value: -11200.000 




Dissection of the maxmin Proqram 


#include <stdio.h> 

float maximumCfloat x, float y); 
float minimumCfloat x, float y); 
void prn_info(void); 

int main(void) 

{ 


» » ( « • 


The function prototypes for the functions maximumO, minimumO, and prn_info() 

occur at the top of the file after any #i ncl ude lines and #defi ne lines. The first two 

function prototypes tell the compiler that the functions maxi mum () and mi m'mum() 

each take two arguments of type float and each return a value of type float. The 

third function prototype tells the compiler that prn_i nfo() takes no arguments and 

returns no value. Note that for the first two function prototypes we could iust as well 
have written 


float maximumCfloat, float); 
float minimumCfloat, float); 


The compiler does not make any use of parameters such as x and y in function proto¬ 
types. The parameters serve only as documentation for the reader of the code. 
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int mainCvoid) 

{ 

int i, n; 
float max, min, x; 

prn_infoC); 

printfC'lnput n: ") 

scanf("%d", &n); 


Variables are declared at the beginning of mainC). The first executable statement in 
mai n C) is 

prn_infoC); 

This statement invokes the function prn_info(). The function contains a single 
pri ntf C) statement that prints information about the program on the screen. The user 
responds to the prompt by typing characters on the keyboard. We use scanf Q to inter¬ 
pret these characters as a decimal integer and to place the value of this integer at the 
address of n. 




printfC"\nlnput %d real numbers 
scanf("%f", &x); 


", n); 


max 


min = x; 


The user is asked to input n real numbers. The first real number is read in, and its value 
is placed at the address of x. Because the assignment operator associates from right to 
left 


max 


mi n 


is equivalent to max = (mi n = x) ; 


Thus, the value x is assigned first to mi n and then to max. (See Section 2.9, “Precedence 
and Associativity of Operators,” on page 83.) 

■ for Ci =2; i <= n; ++i) { 
scanf ("%f", &x); 
max = maximum(max, x); 
min = mini mum(min, x); 


Each time through the loop a new value for x is read in. Then the current values of max 
and x are passed as arguments to the function maximumO, and the larger of the two 
values is returned and assigned to max. Similarly, the current values of mi n and x are 
passed as arguments to the function mi nimumO, and the smaller of the two values is 
returned and assigned to mi n. In C, arguments to functions are always passed by value. 
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This means that a copy of the value of each argument is made, and it is these copies 
that are processed by the function. The effect is that variables passed as arguments to 
functions are not changed in the calling environment. 

■ float maxi mum(float x, float y) 

{ 

if (x > y) 
return x; 
el se 

return y; 


Ihis is the function definition for the function maxi mum (). It specifies explicitly how 
the function will act when it is called, or invoked. A function definition consists of a 


header and a body. The header is the code that occurs before the first left brace {. The 
body consists of the declarations and statements between the braces { and }. For this 
function definition the header is the line 


float maximum(float x, float y) 

fhe first keyword f 1 oat in the header tells the compiler that this function is to return a 
value of type f 1 oat. The parameter list consists of the comma-separated list of identi¬ 
fier declarations within the parentheses ( and ) that occur in the header to the function 
definition. Here, the parameter list is given by 

float x, float y 

Ihe identifiers x and y are formal parameters. Although we have used the identifiers x 
and y both here and in the function mai n(), there is no need to do so. There is no rela¬ 
tionship, other than a mnemonic one, between the x and y used in maxi mum () and the x 
and y used in mainQ. Parameters in a function definition can be thought of as place¬ 
holders. When expressions are passed as arguments to a function, the values of the 
expressions are associated with these parameters. The values are then manipulated 
according to the code in the body of the function definition. Here, the body of the func¬ 
tion definition consists of a single i f-el se statement. The effect of this statement is to 
return the larger of the two values x and y that are passed in as arguments. 


■ return x; 

This is a return statement. The general form of a return statement is 


return; 


or return expr; 


1.7 Y Functions 






flgggisg m 

s 

®§8if®tf 




A retu rn statement causes control to be passed back to the calling environment. If an 
expression follows the keyword return, then the value of the expression is passed 
back as well. 

■ float minimum(float x, float y) 

{ 

if (x < y) 
return x; 
el se 

return y; 

} 

The function definition for mi nimumO comes next. Note that the header to the function 
definition matches the function prototype that occurs at the top of the file. This is a 
common programming style. 

a void prn_info(void) 


This is the function definition for prn_i nfo(). The first voi d tells the compiler that 
this function returns no value, the second that this function takes no arguments. 



Call-by-Value 

In C, arguments to functions are always passed by value. This means that when an 
expression is passed as an argument to a function, the expression is evaluated, and it is 
this value that is passed to the function. The variables passed as arguments to func¬ 
tions are not changed in the calling environment. Here is a program that illustrates this: 
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In file no_change.c 

#include <stdio.h> 

int main(void) 

{ 

int a = 1 ; 

void t ry_to_change_it(int); 

printf("%d\n", a); /* 1 is printed */ 

try_to_change_.it (a); 

printf("%d\n", a); /* 1 is printed again! */ 

return 0 ; 

} 

void try_to_change_it(int a) 

{ 

a = 777; 

} 

When a is passed as an argument, the expression a is evaluated to produce a value that 
we can think of as a copy of a. It is this copy, rather than a itself, that is passed to the 
function. Hence, in the calling environment the variable a does not get changed. 

This argument-passing convention is known as call-by-value. To change the value of a 
variable in the calling environment, other languages provide call-by-reference. In C, to 
get the effect of call-by-reference, pointers must be used. (See Section 6.3, “Call-by-Ref- 
erence,” on page 252.) 


1.8 Arrays, Strings, and Pointers 

In C, a string is an array of characters, and an array name by itself is a pointer. Because 
of this, the concepts of arrays, strings, and pointers are intimately related. A pointer is 
just an address of an object in memory. C, unlike most languages, provides for pointer 
arithmetic. Because pointer expressions of great utility are possible, pointer arithmetic 
is one of the strong points of the language. 
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Arrays 

Arrays are used when many variables, all of the same type, are desired. For example, 
the declaration 

int a[3]; 

allocates space for the three-element array a. The elements of the array are of type int 
and are accessed as a[0], a [1], and a [2], The index, or subscript, of an array always 
starts at 0. The next program illustrates the use of an array. The program reads in five 
scores, sorts them, and prints them out in order. 

In file scores.c 




pSISSp 


f'l ' ; f 


#include <stdio.h> 

#define CLASS_SIZE 5 

int main(void) 

{ 

int i, 3 , score[CLASS_SIZE], sum = 0 , tmp; 

printf("Input %d scores: ", CLASS_SIZE); 
for (i =0; i < CLASS_SIZE; ++i) { 
scanf("%d", &score[i]); 
sum += score[i]; 

} 




for (i =0; i < CLASS_SIZE 


++i ) 


/* bubble sort */ 


for (j = CLASS_SIZE - 1; j > i; — j) 

if (score[j-1] < score [j]) { /* check the order */ 

tmp = score [j-1]; 
score[j-l] = score [j]; 
score[j] = tmp; 

} 

printf("\n0rdered scores:\n\n"); 
for (i =0; i < CLASS_SIZE; ++i) 

printf(" score[%d] =%5d\n", i, score[i]); 
printf("\n%18d%s\n%18.lf%s\n\n", 

sum, " is the sum of all the scores", 

(double) sum / CLASS_SIZE, " is the class average"); 
return 0; 


score [i]); 


If we execute the program and enter the scores 63, 88, 97, 53, 77 when prompted, we 
will see on the screen 
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Input 5 scores: 63 88 97 53 77 

Ordered scores: 
score [0] = 97 

score[1] = 88 

score[2] = 77 

score[3] = 63 

score[4] = 53 

378 is the sum of all the scores 
75.6 is the class average 

A bubble sort is used in the program to sort the scores. This construction is typically 
done with nested for loops, with a test being made in the body of the inner loop to 
check on the order of a pair of elements. If the elements being compared are out of 
order, their values are interchanged. Here, this interchange is accomplished by the code 

tmp = score[j —1]; 
score tj-1] = score[j]; 
score[j] = tmp; 

In the first statement, the variable tmp is used to temporarily store the value of 
score[j-l]. In the next statement, the value of score[j-l] stored in memory is being 
overwritten with the value of score [j ]. In the last statement, the value of score [ j ] is 
being overwritten with the original value of score [i ], which is now in tmp. Hand simu¬ 
lation of the program with the given data will show 7 the reader why this bubble sort con¬ 
struct of two nested for loops achieves an array with sorted elements. The name 
bubble sort comes from the fact that at each step of the outer loop the desired value 
among those left to be worked over is bubbled into position. Although bubble sorts are 
easy to code, they are relatively inefficient. Other sorting techniques execute much 
faster. This is of no concern when sorting a small number of items infrequently, but if 
the number of items is large or the code is used repeatedly, then efficiency is, indeed, 
an important consideration. The expression 

(double) sum / CLASS_SIZE 

which occurs as an argument in the final pri ntf () statement, uses a cast operator. The 
effect of (doubl e) sum is to cast, or convert, the i nt value of sum to a doubl e. Because 
the precedence of a cast operator is higher than that of the division operator, the cast is 
done before division occurs. (See Section 2.9, “Precedence and Associativity of Opera¬ 
tors,” on page 83.) When a doubl e is divided by an i nt, we have what is called a mixed 
expression. Automatic conversion now takes place. The int is promoted to a double, 
and the result to the operation is a doubl e. If a cast had not been used, then integer 
division would have occurred and any fractional part would have been discarded. More¬ 
over, the result would have been an i nt, which would have caused the format in the 
pri ntf () statement to be in error. 
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Strings 

In C, 3. string is tin array of characteis. In this section, in addition to illustrating the use 
|of strings, we want to introduce the use of getcharQ and putcharQ. These are mac- 
ros defined in stdio.h. Although there are technical differences, a macro is used in the 
same way a function is used. (See Section 8.7, “The Macros in stdio.h and ctype.h,” on 
page 382.) The macros getcharQ and putcharQ are used to read characters from the 
keyboard and to print characters on the screen, respectively. 

Our next program stores a line typed in by the user in an array of characters (a string) 

and then prints the line backwards on the screen. The program illustrates how charac¬ 
ters in C can be treated as small integers. 


JfA In file nice„day.c 


;r... 

t:-..., ' . • 
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/* Have a nice day! */ 

#include <ctype.h> 
finclude <stdio.h> 


#define MAXSTRING 100 




HP 


int main (void) 

{ 

char c, name[MAXSTRING]; 

int i, sum = 0; 





■p 


printf("\nHi! What is your name? "); 
for (i =0; (c = getchar()) != *\n'; ++i) { 
name[i] = c; 
if (isalpha(c)) 
sum += c; 

} 

namefi] = ’\0'; 

printf("\n%s%s%s\n%s", 

"Nice to meet you ", name, 

"Your name spelled backwards is "); 
for (—i; i >= 0; —i) 
putchar(name[i]); 
printf("\n%s%d%s\n\n%s\n", 

"and the letters in your name sum to ", sum 
"Have a nice day!"); ’ 

return 0; 


II If 
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If we ran the program and enter the name Al i ce B, Carol e when prompted, the fol¬ 
lowing appears on the screen: 

Hi! What is your name? Alice B. Carole 
Nice to meet you Alice B. Carole. 

Your name spelled backwards is eloraC .B ecilA 
and the letters in your name sum to 1142. 

Have a nice day! 



Dissection of the nice_day Program 


■ #include <ctype.h> 

#include <stdio.h> 

The standard header file stdio.h contains the function prototype for pri ntf (). It also 
contains the macro definitions for getcharQ and putcharQ, which will be used to 
read characters from the keyboard and to write characters to the screen, respectively. 
The standard header file ctype.h contains the macro definition for i sal pha(), which 
will be used to determine if a character is alphabetic —that is, if it is a lower- or upper¬ 
case letter, 

■ #define MAXSTRING 100 

The symbolic constant MAXSTRING will be used to set the size of the character array 
name. We are making the assumption that the user of this program will not type in more 
than 99 characters. Why 99 characters? Because the system will add the ' \0' as one 
extra guard character terminating the string. 

■ char c, name[MAXSTRING]; 

int i, sum = 0; 

The variable c is of type char. The identifier name is of type array of char, and its size 
is MAXSTRING. In C, all array subscripts start at 0. Thus, name[0], name[l], ... , 
name [MAXSTRING - 1] are the elements of the array. The variables i and sum are of 
type i nt; sum is initialized to 0. 

■ printf("\nHi! What is your name? "); 

This is a prompt to the user. The program now expects a name to be typed in followed 
by a carriage return. 
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■ (c = getcharO) != ’\n’ 

This expression consists of two parts. On the left we have 
(c = getcharO) 

Unlike other languages, assignment in C is an operator. (See Section 2.11, “Assignment 
Operators,” on page 87.) Here, getcharO is being used to read a character from the 
keyboard and to assign it to c. The value of the expression as a whole is the value of 
whatever is assigned to c. Parentheses are necessary because the order of precedence 
of the = operator is less than that of the ! = operator. Thus, 


c = getcharO ! 


1 \ ** f 


is equivalent to c = (getcharO != '\n') 


which is syntactically correct, but not w'hat we want. In Section 2.9, “Precedence and 
Associativity of Operators,” on page 83, we discuss in detail the precedence and asso¬ 
ciativity of operators. 


getcharO) 1= ’\n' j ++i) { 


for (i = 0; (c = g 
name[i] = c; 
if (isalpha(c)) 
sum += c: 


The variable i is initially assigned the value 0. Then getcharO gets a character from 
the keyboard, assigns it to c, and tests to see if it is a newline character. If it is not, the 
body of the for loop is executed. First, the value of c is assigned to the array element 
name[i]. Next, the macro isalphaO is used to determine whether c is a lower- or 
uppercase letter. If it is, sum is incremented by the value of c. As we will see in Section 
3.3, “Characters and the Data Type char,” on page 111, a character in C has the integer 
value corresponding to its ASCII encoding. For example, ' a' has value 97, ' b' has value 
98, and so forth. Finally, the variable i is incremented at the end of the for loop. The 
for loop is executed repeatedly until a newline character is received. 

■ name[i] = '\0’; 

After the for loop is finished, the null character \0 is assigned to the element name [i ]. 
By convention, all strings end with a null character. Functions that process strings, such 
as pri ntf (), use the null character \0 as an end-of-string sentinel. We now can think of 
the array name in memory as 
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ole 



0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 99 

■ printf("\n%s%s%s\n%s", 

"Nice to meet you ", name, 

"Your name spelled backwards is "); 

Notice that the format %s is used to print the character array name. The elements of the 
array are printed one after another until the end-of-string sentinel \0 is encountered. 

■ for (— i; i >= 0; —i) 

putchar(name[i]); 

If we assume that A1 ice B. Carol e followed by a carriage return was typed in, then i 
has value 15 at the beginning of this for loop. (Do not forget to count from 0, not 1.) 
After i has been decremented, the subscript corresponds to the last character of the 
name that was typed in. Thus, the effect of this for loop is to print the name on the 
screen backwards. 

■ printf("\n%s%d%s\n\n%s\n", 

"and the letters in your name sum to ", sum, 

"Have a nice day!"); 

We print the sum of the letters in the name typed in by the user, and then we print a 
final message. After the sum of the letters in the name is printed, a period is printed. 
Two newlines are used to create a blank line before the final message is printed. Notice 
that this pri ntf Q style allows us to easily visualize what is to appear on the screen. 


Pointers 


A pointer is an address of an object in memory. Because an array name is itself a 
pointer, the uses of arrays and pointers are intimately related. The following program is 
designed to illustrate some of these relationships: 
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In file abc.c 

#include <stdio.h> 
#include <string.h> 

#define MAXSTRING 100 

int main(void) 

{ 


char 


'a', *p, s[MAXSTRING]; 


p, *p + 1, *p + 2) 


p = &c; 

printf("%c%c%c ", *p, *p + 1, *p + 2); 

strcpy(s, "ABC"); 

p^intf("%s %c%c%s\n", s, *s + 6, *s + 7, s + 1); 

strcpy(s, "she sells sea shells by the seashore"); 
p — s + 14 \ 

for ( ; *p != '\0'; ++p) { 
if (* p == 'e’) 

*p = ’E'; 

if (*p == ' ') 

*p = *\n'; 

} 

printf("%s\n", s); 
return 0; 


The output of tliis program is 

abc ABC CHBC 

she sells sea shElls 

by 

thE 

sEashorE 
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Dissection of the abc Program 


■ #include <string.h> 

The standard library contains many string-handling functions. (See Section 6.11, 
“String-Handling Functions in the Standard Library,” on page 272.) The standard header 
file string.h contains the function prototypes for these functions. In this program we 
will use strcpyQ to copy a string. 
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■ char c = 'a', *p, s[MAXSTRING]; 

The variable c is of type char. It is initialized with the value 1 a'. The variable p is of 
tyP e Pointer to char. The string s has size MAXSTRING. 

■ p = &c; 

The symbol & is the address operator. The value of the expression &c is the address in 

memory of the variable c. The address of c is assigned to p. We now think of p as point¬ 
ing to c, 

h printf("%c%c%c ", *p, *p + i f *p + 2); 

The format %c is used to print the value of an expression as a character. The symbol * 
is the dereferencing, or indirection, operator. The expression *p has the value of what¬ 
ever p is pointing to. Because p is pointing to c and c has the value ' a', this is the value 
of the expression *p and an a is printed. The value of the expression *p + 1 is one 
more than the value of ,r p, and this causes a b to be printed. The value of the expression 
• f p + 2 is two more than the value of *p, and this causes a c to be printed, 

■ "ABC" 

A string constant is stored in memory as an array of characters, the last of which is the 
nuil character \0. Thus, the size of the string constant "ABC" is 4, not 3. Even the null 
string "" contains one character, namely \0. It is important to realize that string con¬ 
stants are of type array of char. An array name by itself is treated as a pointer, and this 
is true of string constants as well. 

B strcpy(s, "ABC"); 

The function strcpyQ takes two arguments, both of type pointer to char, which we 
can think of as strings. The string pointed to by its second argument is copied into 
memoiy beginning at the location pointed to by its first argument. All characters up to 
and including a null character are copied. The effect is to copy one string into another. 
It is the responsibility of the programmer to ensure that the first argument points to 

enough space to hold all the characters being copied. After this statement has been exe¬ 
cuted, we can think of s in memory as 



0 1 2 3 4 99 
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g pri ntf("%s %c%c%s\n", s, *s + 6, *s + 7 , s + 1) ; 

The array name s by itself is a pointer. We can think of s as pointing to s [0], or we can 
think of s as being the base address of the array, which is the address of s [0]. Printing 
s in the format of a string causes ABC to be printed. The expression *s has the value of 
what s is pointing to, which is s [0]. This is the character A. Because six letters more 
than A is G and seven letters more than A is H, the expressions *s + 6 and *s + 7, 
printed in the format of a character, cause G and H to be printed, respectively. The 
expression s + 1 is an example of pointer arithmetic. The value of the expression is a 
pointer that points to s [1], the next character in the array. Thus, s + 1 printed in the 
format of a string causes BC to be printed. 

■ strcpyfs, "she sells sea shells by the seashore"); 

This copies a new string into s. Whatever was in s before gets overwritten. 

■ p = s + 14; 

The pointer value s + 14 is assigned to p. An equivalent statement is 


p = &s[14]; 

If you count carefully, you will see that p now points to the first letter in the word 
“she! 1 s.” Note carefully that even though s is a pointer, it is not a pointer variable, but 
rather a pointer constant. A statement such as 


is legal because p is a pointer variable, but the statement 


would result in a syntax error. Although the value of what s points to may be changed, 
the value of s itself may not be changed. 


for ( ; *p != ’\0’; ++p) { 
if (*p == ’e') 

P = t i 
if (*p == ' ’) 

*p = '\n'; 

} 


As long as the value of what p is pointing to is not equal to the null character, the body 
of the for loop is executed. If the value of what p is pointing to is equal to ' e', then 
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that value in memory is changed to ’ E'. If the value of what p is pointing to is equal to 
1 1 , then that value in memory is changed to 1 \n 1 . The variable p is incremented at the 
end of the for loop. This causes p to point to the next character in the string. 

■ printf("%s\n", s); 

The variable s is printed in the format of a string followed by a newline character. 
Because the previous for loop changed the values of some of the elements of s, the to - 

lowing is printed: 

■ she sells sea shElls 
by 

thE 

sEashorE 

In C, arrays, strings, and pointers are closely related. To illustrate consider the decla 
ration 

char *p, s[100]; 

This creates the identifier p as a pointer to char and the identifier s as an array of 100 
elements of type char. Because an array name by itself is a pointer, both p and s are 
pointers to char. However, p is a variable pointer, whereas s is a constant pointer that 
points to s [0]. Note that the expression ++p can be used to increment p, but because s 
is a constant pointer, the expression ++s is wrong. The value of s cannot be changed. Of 
fundamental importance is the fact that the two expressions 

s[i] and *(s + i) 

are equivalent. The expression s [i ] has the value of the fth element of the array (count¬ 
ing from 0), whereas *(s + i) is the dereferencing of the expression s + i, a pointer 
expression that points i character positions past s. In a similar f ashion, the two expres¬ 
sions 

p[i] and *(p + i) 
are equivalent. 
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1.9 Files 

Files are easy to use in C. To open the file named my_file, the following code can be 
used: 

In file read Jt.c 

#include <stdio.h> 

int main(void) 

{ 

int c; 

FILE *ifp; 

ifp = fopen("my_file", "r"); 


The second line in the body of mai n () declares i fp (which stands for infile pointer) to 
be a pointer to FILE. The function fopenQ is in the standard library, and its function 
prototype is in stdio.h. The type FILE is defined in stdio.h as a particular structure. To 
use the construct, a user need not know any details about it. However, the header file 
must be made available by means of a #i ncl ude directive before any reference to FILE 
is made. The function fopenO lakes two strings as arguments, and it returns a pointer 
to FILE. The first argument is the name of the file, and the second argument is the 
mode in which the file is to be opened. 

! Three modes for a file 


ri pir 


"a" 


for read 
for write 
for append 


When a file is opened for writing and it does not exist, i t is created. If it already 
exists, its contents are destroyed and the writing starts at the beginning of the file. If 
for some reason a file cannot be accessed, the pointer value NULL is returned by 
fopenO. After a file has been opened, all references to it are via its file pointer. Upon 
the completion of a program, the C system automatically closes all open files. All C sys¬ 
tems put a limit on the number of files that can be open simultaneously. Typically, this 
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limit is either 20 or 64. When using many files, the programmer should explicitly close 
any files not currently in use. The library function fcloseO is used to close files. 

Let us now examine the use of files. With text, it is easy to make a frequency analysis 
of the occurrence of the characters and words making up the text. Such analyses have 
proven useful in many disciplines, from the study of hieroglyphics to the study of 
Shakespeare. To keep things simple, we will write a program that counts the occur¬ 
rences of just uppercase letters. Among our files is one named chapter 1, which is the 
current version of this chapter. We will write a program called cntjetters that wall open 
files for reading and writing to do the analysis on this chapter. We give the command 

cntjetters chapter1 datal 

to do this. However, before we present our program, let us describe how command line 
arguments can be accessed from within a program. The C language provides a connec¬ 
tion to the arguments on the command line. Typically, to use the connection one would 
code 

#include <stdio.h> 

int main (int argc, char *argv[]) 

{ 


Up until now we have always invoked mai n () as a function with no arguments. In fact, 
it is a function that can have arguments. The parameter argc stands for argument 
count. Its value is the number of arguments in the command line that was used to exe¬ 
cute the program. The parameter argv stands for argument vector. It is an array of 
pointers to char. Such an array can be thought of as an array of strings. The successive 
elements of the array point to successive words in the command line that was used to 
execute the program. Thus, argv[0] is a pointer to the name of the command itself. As 
an example of how this facility is used, suppose that we have written our program and 
have put the executable code in the file cntjetters. The intent of the command line 

cntjetters chapter1 datal 

is to invoke the program cntjetters with the two file names chapterl and datal as com¬ 
mand line arguments. The program should read file chapterl and write to file datal. If 
we give a different command, say 


cntjetters chapterZ data2 


then the program should read file chapter2 and write to file data2. In our program the 
three words on the command line will be accessible through the three pointers 
argv[0], argv[l], and argv[2]. 

Here is our program: 

In file cnt Jetters.c 

/* Count uppercase letters in a file. */ 

#include <stdio.h> 

#include <stdlib.h> 

int mainfint argc, char *argv[]) 

{ 

int c, i, letter[26]; 
p t i p * -j f n * 

1 (nnMM ▼ I S | 

if (argc != 3) { 

printf("\n%s%s%s\n\n%s\n%s\n\n", 

"Usage: ", argv[0], " infile outfile", 

"The uppercase letters in infile will be counted.", 

"The results will be written in outfile."); 
exit(l)■ 

V- A S U ^ X J f 

} 

ifp = fopen(argv[l], "r"); 
ofp = fopen(argv[2], "w"); 

for (i =0; i <26; ++i) /* initialize array to zero */ 

1etter[i] = 0; 

while ((c = getc(ifp)) != EOF) 

if (c >= 'A' && c <= 'Z') /* find uppercase letters */ 

++letterfc - 'A'l■ 

III S— L V- V- I L v- I \ _J | 

for (i = 0; i < 26; ++i) { /* print results */ 

if (i % 6 == 0) 

putc('\n\ ofp); 

fprintf(ofp, "%c:%5d ", 'A' + i, letter[i]); 

} 

putc('\n', ofp); 
return 0; 


After we have given the command 


cntjetters chapterl datal 
this is what we find in the file datal; 
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A : 

223 

D * 

D « 

62 

c * 

193 

D; 

31 

E : 

120 

r . 

89 

G: 

21 

H: 

48 

T % 

*mt* it 

304 

3: 

1 

K: 

7 

L : 

50 

M: 

67 

N: 

77 

0: 

91 

P: 


Q: 

19 

0 i 
# 

57 

S: 

196 

T: 

439 

U: 

33 

V: 

4 

W: 

68 

X: 

29 

Y: 

7 

y , 

w-W % 

18 










Observe that the frequency of the letters in chapterl is not what one expects in ordi 
nary text. 



Dissection of the cntjetters Program 

a int c, i, letter[26]; 

FILE «ifp, ,c ofp; 

The array letter will be used to count the occurrences of the uppercase letters. The 
variables i fp and ofp are of type pointer to FILE. We often use the identifiers i fp and 
of p, which stand for infile pointer and outfile pointer, respectively. 

■ if (argc != 3) { 

printf("\n%s%s%s\n\n%s\n%s\n\n", 

"Usage: ", argv[0], " infile outfile", 

"The uppercase letters in infile will be counted.", 

"The results will be written in outfile."); 

AVI ^ * 

fc: A I L J f 


If the number of words on the command line is not three, then the program is being 
used incorrectly. This causes a message to be printed and the program to be exited. 
Suppose the following command line is typed: 

cntjetters chapterl abc abc 

Because the line contains four words, argc will have value 4, and this Mil cause the fol¬ 
lowing message to appear on the screen: 

Usage: cnt„letters infile outfile 

The uppercase letters in infile will be counted. 

The results will be written in outfile. 



ifp = fopen(argv[l], "r"); 
ofp = fopen(argv[2], "w"); 
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If we assume that we have typed the command line 
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cntjetters chapterl datal 


to execute this program, then argv[0] points to the string "cnt_letters", argv[l] 
points to the string "chapterl", and argv[2] points to the string "datal". The C sys¬ 
tem does this automatically. Thus, the file chapterl is opened for reading with file 
pointer i fp referring to it, and the file datal is opened for writing with file pointer ofp 
referring to it. 


for (i =0; i <26; ++i) 
letter[i] = 0; 


/* initialize array to zero */ 


In ANSI C, automatically allocated local array elements need not be initialized to zero. 
To be sure, the programmer must do it. 

■ (c = getc(ifp)) != EOF 

The function getc() is a macro defined in stdio.h. It is similar to getcharO except 
that it takes as an argument a pointer to FILE. When getc(ifp) is invoked, it gels the 
next character from the file pointed to by i f p. The identifier EOF stands for end-of-file. 
It is a symbolic constant defined in stdio.h , typically by the line 

#define EOF (-1) 


The value EOF is returned by getc() when there are no more characters in the file. In C, 
characters have the integer value corresponding to their ASCII encoding. (See Section 
3.3, “Characters and the Data Type char,” on page 111.) For example, 'a' has value 97, 
' b' has value 98, and so forth, A char is stored in 1 byte, and an i nt is typically stored 
in either 2 or 4 bytes. Thus, a char can be considered a small integer type. Conversely, 
an i nt can be considered a large character type. In particular, an i nt can hold all the 
values of a char and other values as well, such as EOF, which is not an ordinary charac¬ 
ter value. The variable c was declared to be an i nt rather than a char because it even¬ 
tually would be assigned the value EOF. 

■ while ((c = getc(ifp)) != EOF) 

if (c >= 'A' && c <= 'Z’) /* find uppercase letters */ 

++letter[c - 'A']; 


A character is read and assigned to c. If the value of the character is not EOF, then the 
body of the whi 1 e loop is executed. 
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■ c >= 'A' && c <= 'Z' 

The expression c >= 'A' is true if c is greater than or equal to 'A'. Similarly, the 
expression c <= ' Z' is true if c is less than or equal to ' Z'. The symbols && represent 
the logical and operator. An expression of the form 

exprl && expr2 

is true if and only if both exprl and expr2 are true. Because of operator precedence 

c >= 'A' && c <= is equivalent to (c >= 'A') && (c <= 'Z') 

Thus, the expression c >= ' A' && c <= ' Z' is true if and only if c has the value of an 
uppercase letter. 

1 ++1etter[c - 'A']; 

If c has the value 'A', then c - 'A' has the value 0. Thus, the array element 1 et¬ 
ter [0] gets incremented when c has the value 'A'. Similarly, if c has the value ‘B\ 
then c - 'A' has the value 1. Thus, the array element 1 etter[1] gets incremented 
when c has the value ' B'. In this way a count of the uppercase letters is kept in the ele¬ 
ments of the array letter with letter [0] corresponding to the letter A, 1 etter[1] 
corresponding to the letter B, and so forth. 

■ for (i = 0; i < 26; ++i) { /* print results */ 

if (i % 6 == 0) 
putc(’\n', ofp); 

The symbol % is the modulus operator. An expression such as a % b yields the remain¬ 
der of a divided by b. For example, 5 % 3 has the value 2. In the body of the for loop 
we have used the expression i % 6, which has the value 0 whenever the value of i is a 
multiple of 6. Because of operator precedence, the expression 

i % 6 == 0 is equivalent to (i % 6) == 0 

Thus, the expression i % 6 == 0 is true every sixth time through the loop; at these 
times a newline character is printed. If you look at the output of the program, you will 
see that it is printed in six columns. The macro putc() is defined in stdio.h. It is similar 
to putchar O except that its second argument is a pointer to FILE. The value of its first 
argument is written to the indicated file in the format of a character. 

■ fprintf(ofp, "%c:%5d ", 'A' + i, 1etter[i]); 

The function fprintfQ is similar to printfQ except that it takes as its first argu¬ 
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ment a pointer to FILE. When the function is invoked, it writes to the indicated file 
rather than to the screen. Observe that 'A' + i is being printed in the format of a char¬ 
acter. When i is 0, the expression 'A' + i has the value 'A', causing the letter A to be 
printed; when i is 1, the expression 'A' +1 has the value 1 B ', causing the letter B to 
be printed; and so forth. 


Although we did not do so, we could have explicitly closed the open files just before we 
exited from mai n(). Instead, we relied on the C system to close the files. We would use 
the following code to explicitly close the files: 

fclose(ifp); 
fclose(ofp); 


1.10 Operating System Considerations 

In this section, we discuss a number of topics that are system-dependent. We begin 
with the mechanics of writing and running a p ' cim 


Writing and Running a C Program 

The precise steps that have to be followed to create a file containing C code and to com¬ 
pile and execute it depend on three things: the operating system, the text editor, and 
the compiler. However, in all cases the general procedure is the same. We first describe 
in some detail how it is done in a UNIX environment. Then we discuss how it is done in 
an MS-DOS environment. 

In the discussion that follows, we will be using the cc command to invoke the C com¬ 
piler. In reality, however, the command depends on the compiler that is being used. For 
example, if we were using the command line version of the Turbo C compiler from Bor¬ 
land, then we would use the command fee instead of cc. (For a list of C compilers, see 
the table in Section 11.13, “The C Compiler,” on page S22.) 
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1 Using an editor, create a text file, say pgm.c, that contains a C program. The 
name of the file must end with .c, indicating that the file contains C source code. 
For example, to use the vi editor on a UNIX system, we would give the command 

vi pgm.c 

To use an editor, the programmer must know the appropriate commands for 
inserting and modifying text. 

2 Compile the program. This can be done with the command 

cc pgm.c 

The cc command invokes in turn the preprocessor, the compiler, and the loader. 
The preprocessor modifies a copy of the source code according to the prepro¬ 
cessing directives and produces what is called a translation unit. The compiler 
translates the translation unit into object code. If there are errors, then the pro¬ 
grammer must start again at step 1 with the editing of the source file. Errors that 
occur at this stage are called syntax errors or compile-time errors. If there are no 
errors, then the loader uses the object code produced by the compiler, along with 
object code obtained from various libraries provided by the system, to create the 
executable file a.out. The program is now ready to be executed. 

3 Execute the program. This is done with the command 

a.out 

Typically, the program will complete execution, and a system prompt will reap¬ 
pear on the screen. Any errors that occur during execution arc called, run-time 
errors. If for some reason the program needs to be changed, the programmer 
must start again at step 1. 

If we compile a different program, then the file a.out will be overwritten, and its pre¬ 
vious contents lost. If the contents of the executable file a.out are to be saved, then the 
file must be moved, or renamed. Suppose that we give the command 


cc sea.c 


This causes executable code to be written automatically into a.out. To save this file, we 
can give the command 


mv a.out sea 
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IS This causes a.out to be moved to sea. Now the program can be executed by giving the 
•command 


IT sea 

■ In UNIX, it is common practice to give the executable file the same name as the corre- 
sponding source file, except to drop the .c suffix. If we wish, we can use the -o option to 
|direct the output of the cc command. For example, the command 

t cc -o sea sea.c 

causes the executable output from cc to be written directly into sea, leaving intact what¬ 
ever is in a.out. 

Different kinds of errors can occur in a program. Syntax errors are caught by the 
compiler, whereas run-time errors manifest themselves only during program execution. 
For example, if an attempt to divide by zero is encoded into a program, a run-time error 
may occur when the program is executed. (See exercise 5, on page 61, and exercise 6, on 
page 61.) Usually, an error message produced by a run-time error is not very helpful in 
finding the trouble. 

Let us now consider an MS-DOS environment. Here, some other text editor would 
most likely be used. Some C systems, such as Turbo C, have both a command line envi¬ 
ronment and an integrated environment. The integrated environment includes both the 
text editor and the compiler. (Consult Turbo C manuals for details.) In both MS-DOS and 
UNIX, the command that invokes the C compiler depends on which C compiler is being 
used. In MS-DOS, the executable output produced by a C compiler is written to a file 
having the same name as the source file, but with the extension .exe instead of ,c. Sup¬ 
pose, for example, that we are using the command line environment in Turbo C. If we 
give the command 

tcc sea.c 

then the executable code will be written to sea. exe. To execute the program, we give the 
command 

sea.exe or equivalently sea 


To invoke the program, we do not need to type the .exe extension. If we wish to rename 
this file, we can use the rename command. 
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Interrupting a Program 

When running a program, the user may want to interrupt, or kill, the program. For 
example, the program may be in an infinite loop. (In an interactive environment it is not 
necessarily wrong to use an infinite loop in a program.) Throughout this text we 
assume that the user knows how to interrupt a program. In MS-DOS and in UNIX, a con- 
trol-c is commonly used to effect an interrupt. On some systems a special key, such as 
delete or rubout is used. Make sure that you know how to interrupt a program on your 
system. 


Typing an End-of-file Signal 

When a program is taking its input from the keyboard, it may be necessary to generate 
an end-of-file signal for the program to work properly. In UNIX, a carriage return fol¬ 
lowed by a control-d is the typical way to effect an end-of-file signal. (See exercise 26, 
on page 68, for further discussion.) 

Redirection of the Input and the Output 

Many operating systems, including MS-DOS and UNIX, can redirect the input and the 
output. To understand how this works, first consider the UNIX command 

Is 

This command causes a list of files and directories to be written to the screen. (The 
comparable command in MS-DOS is dir.) Now consider the command 

Is > tmp 

The symbol > causes the operating system to redirect the output of the command to 
the file tmp. What was written to the screen before is now written to the file tmp. 

Our next program is called dbLout. It can be used with redirection of both the input 
and the output. The program reads characters from the standard input file, which is 
normally connected to the keyboard, and writes each character twice to the standard 
output file, which is normally connected to the screen. 
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In file dbLout.c 

ft #include <stdio.h> 
int main(void) 

l- { 

char c; 

while (scanf("%c", &c) 
printf("%c", c); 
printf( M %c", c); 

return 0; 

T } 


- 1) { 


i*t£f 


If we compile the program and pul the executable code in the file dbLout then using 
redirection, we con invoice the program in four Wctys* 






dbLout 
dbLout < 
dbLout > 
dbLout < 


infile 
outfile 
infile > 


outfile 
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Some commands are not meant to be used with redirection. For example the Is mm- 
SctrST^ characters from the keyboard. Therefore, it makes no sense to redi¬ 
nothing to redirect. command > because it does not take keyboard input, there is 
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Summary 

1 Programming is the art of communicating algorithms to computers. An algorithm, is 
a computational procedure whose steps are completely specified and elementary. 

2 The C system provides a standard library of functions that can be used by the pro¬ 
grammer. Two functions from the library are printf () and scanfC). They are 
used for output and input, respectively. The function printf () can print out 
explicit text and can use conversion specifications that begin with the character % 
to print the values of the arguments that follow the control string. The use of 
scanf () is somewhat analogous, but conversion specifications in the control string 
are matched with the other arguments, all of which must be addresses (pointers). 

3 The C compiler has a preprocessor built into it. Lines that begin with a # are prepro¬ 
cessing directives. A #defi ne directive can be used to define symbolic constants. A 
#i ncl ude directive can be used to copy the contents of a file into the code. 

4 Statements are ordinarily executed sequentially. Special statements such as i f, 
i f-el se, for, and while statements can alter the sequential flow of control during 
execution of a program. 

5 A program consists of one or more functions written in one or more files. Execution 
begins with the function mai n(). Other functions may be called from within 
mai n() and from within each other. 

6 A function definition is of the form 

type function_name(parameter type list) 

{ 

declarations 

statements 

} 

A function definition consists of two parts, a header and a body. The header con¬ 
sists of the code before the first left brace, and the body consists of the declara¬ 
tions and statements between the braces. 


7 In a function definition all declarations must occur before any statements. All vari¬ 
ables must be declared. Compound statements are surrounded by the braces 
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Exercises 

1 On the screen write the words 

she sells sea shells by the seashore 
(a) all on one line, (b) on three lines, (c) inside a box. 

2 Use a hand calculator to verify that the output of the marathon program is correct. 
Create another version of the program by changing the floating constant 1760.0 to 
an integer constant 1760. Compile and execute the program and notice that the out 
put is not the same as before. This is because integer division discards any irac 

tional part, 

3 Write a version of the marathon program, in Section 1.3, Variables, Expressions, 
and Assignment,” on page 11, in which all constants and variables are of type dou» 
bl e. Is the output of the program exactly the same as that of the original program? 

4 Take one of your working programs and alter it by deleting the keyword voi d in the 
line 

int main (void) 

When you compile your program, does your compiler complain? Probably not. (See 
Section 5.3, “Function Prototypes,” on page 202, for further discussion.) Next, 
remove the keyword voi d and remove the following line from the body of inai n (). 

return 0; 

When you compile the program, does your compiler complain? Ihis time it should. 
If your compiler does not complain, learn how to set a higher warning level for your 
compiler. Generally speaking, programmers should always use the highest warning 
level possible. One of the principal rules of programming is keep yout compilei 
happy, but not at the expense of turning off all the warnings. Programmers should 
rework their code repeatedly until all the warnings have vanished. 


'ogram compiles without any error messages. Run the pr 
)f integer division by zero. On most systems this progra 
rror. If tins happens on your system, try to rewrite the pr 
jle y, but keep the error in the program. That is, divide t 
happens? 

2 for logically infinite floating values. Modify the progra: 
/ changing int to double and. in the orintfO statemer 
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9 The following program writes a large letter I on the screen; 

#include <stdio.h> 

#define BOTTOM_SPACE "\n\n\n\n\n" 

#define HEIGHT 17 

#define OFFSET " " /* 17 blanks */ 

#define TOP_SPACE "\n\n\n\n\n" 

int main (void) 

{ 

int i; 

pri ntf(TOP_SPACE); 
printf(OFFSET "IIIIIII\n"); 
for (i =0; i < HEIGHT; ++i) 
printf(OFFSET " III\n"); 
printf(OFFSET "IIIIIII\n"); 
pri ntf(BOTTOM_SPACE); 
return 0; 

} 

Compile and run this program so that you understand its effect. Write a similar 
program that prints a large letter C on the screen. 

10 Take a working program and omit each line in turn and run it through the compiler. 
Record the error messages each such deletion causes. As an example, consider the 
following code in the file nonsense.c: 

#include <stdio.h> 

/* forgot main */ 

{ 

printf("nonsense\n"); 

} 

11 Write a program that asks interactively for your name and age and responds with 

Hello name, next year you will be nexLage. 
where next_age is age + 1. 
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12 Write a program that neatly prints a table of powers. The first few lines of the table 


might look like this: 

:;::: A TABLE OF POWERS 
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Integer Square 3rd power 4th power 5th power 


1 

1 

1 

i 

1 

2 

4 

8 

16 

32 

3 9 

A for loop has the form 

for ( exprl ; exprZ ; expr3') 
statement 

27 

81 

243 




If all three expressions are present, then this is equivalent to 
exprl ; 

while (expr2) { 
statement 
expr3 ; 

} 

Why is there no semicolon following statement'? Of course, it may well happen that 
statement itself contains a semicolon at the end, but it does not have to. In C, a 
compound statement consists of braces surrounding zero or more other state¬ 
ments, and a compound statement is itself a statement. Thus, both { } and { ; ; 
; } are statements. Try the following code: 

int i; 


for (i 

= 0; i < 3 

; ++i) 


{ } 
for (i 

= 0; i < 3 

; ++i) 

/* no semicolon V 


{ ; ; ; } 


/* three semicolons 
but none after the statement */ 


Is your compiler happy with this? (It should be.) Compilers care about legality. If 
what you write is legal but otherwise nonsense, your compiler will be happy. 

14 The standard header files supplied by the C system can be found in one or more 
system-dependent directories. For example, on UNIX systems these header files 
might be in /usr/inchide. On Turbo C systems they might be in \turboc\include or 
\tc\include or \bc\include. Find the location of the standard header file stdio.h on 
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your system. Read this file and find the line that pertains to pri ntf (). The line will 
look something like 

int printf(const char *format, 

This line is an example of a function prototype. Function prototypes tell the com¬ 
piler the number and type of arguments that are expected to be passed to the func¬ 
tion and the type of the value that is returned by the function. As we will see in 
later chapters, strings are of type “pointer to char,” which is specified by char *. 
The identifier format is provided only for its mnemonic value to the programmer. 
The compiler disregards it. The function prototype for pri ntf C) could just as well 
have been written 


int printf(const char 


.. .); 


The keyword const tells the compiler that the string that gets passed as an argu¬ 
ment should not be changed. The ellipses .. . indicate to the compiler that the 
number and type of the remaining arguments vary. The pri ntf () function returns 
as an i nt the number of characters transmitted, or a negative value if an error 
occurs. Recall that the first program in this chapter in Section 1.2, “Program Out¬ 
put,” on page 6, prints the phrase “from sea to shining C” on the screen. Rewrite the 
program by replacing the #i ncl ude line with the function prototype for pri ntf Q 
given above. Caution: You can try to use verbatim the line that you found in stdio.h, 
but your program may fail. (See Chapter 8, “The Preprocessor.”) 

15 (Suggested to us by Donald Knuth at Stanford University.) In the running_sum pro¬ 
gram in Section 1.6, “Flow of Control,” on page 26, we first computed a sum and 
then divided by the number of summands to compute an average, the following 
program illustrates a better way to compute the average: 

/* Compute a better average. V 
#include <stdio.h> 
int main(void) 


int 

double 
double 
double 
double 


X # 

J 

avg =0.0; 
navg; 

sum = 0.0; 


/* a better average */ 
/* a naive average */ 


printf("%5s%17s%17s%17s\n%5s%17s%17s%17s\n\n" 
"Count". "Item". "Averaqe", "Naive avg", 
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for (i = 1; scanf("%lf", &x) == 1; ++i) { 
avg += (x - avg) / i; 
sum += x; 
navg = sum / i; 

printf("%5d%17e%17e%17e\n", i, x, avg, navg) 

} 

return 0; 


Run this program so that you understand its effects. Note that the better algorithm 
for computing the average is embodied in the line 

avg += (x - avg) / i; 

Explain why this algorithm does, in fact, compute the running average. Hint: Do 
some simple hand calculations first. 

In the previous exercise we used the algorithm suggested to us by Donald Knuth to 
write a program that computes running averages. In this exercise we want to use 
that program to see what happens when sum gets too large to be represented in the 
machine. (See Section 3.6, “The Floating Types,” on page 119, for details about the 
values a double can hold.) Create a file, say data , and put the following numbers 
into it: 

le308 1 le308 1 le308 

Run the program, redirecting the input so that the numbers in your file data get 
read in. Do you see the advantage of the better algorithm? 

(Advanced) In this exercise you are to continue the work you did in the previous 
exercise. If you run the better_average program taking the input from a file that 
contains some ordinary numbers, then the average and the naive average seem to 
be identical. Find a situation where this is not the case. That is, demonstrate experi¬ 
mentally that the better average really is better, even when sum does not overflow. 

Experiment with the type qualifier const. How does your compiler treat the follow¬ 
ing code? 


const int 


a = 0: 




a = 333 ; 

pri ntf C'^dXn 11 , a) 
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19 Put the following lines into a program and run it so that you understand its effects: 

int al, a2, a3, cnt; 

printfC'Input three integers: "); 

cnt = scanf("%d%d%d", &al, &a2, &a3); 

printf("Number of successful conversions: %d\n", cnt); 

What happens if you type the letter x when prompted by your program? What num¬ 
bers can be printed by your program? Hint: If scanf () encounters an end-of-file 
mark before any conversions have occurred, then the value EOF is returned, where 
EOF is defined in stdio.h as a symbolic constant, typically with the value -1. You 
should be able to get your program to print this number. 

20 In ANSI C the printf () function returns as an int the number of characters 
printed. To see how this works, write a program containing the following lines: 

int cnt; 

cnt = printf("abc abc"); 

printf("\nNo. of characters printed: %d\n", cnt); 

What gets printed if the control string is replaced by 

"abc\nabc\n" or ”abc\0abc\0" 

21 In the previous exercise you were able to get different numbers printed, depending 
on the input you provided. Put the following lines into a program: 

char cl, c2, c3; 
int cnt; 

printfC'Input three characters: "); 

cnt = scanf("%c%c%c", &cl, &c2, &c3); 

printf("Number of successful conversions: %d\n", cnt); 

By varying the input, what numbers can you cause to be printed? Hint: The numbers 
printed by the program you wrote for exercise 19, on page 66, can be printed here, 
but you have to work much harder to do it. 


Y Exercises 


22 Use the ideas presented in the nice_day program in Section 1.8, “Arrays, Strii 
i and Pointers,” on page 39, to write a program that counts the total number of 

ters. Use redirection to test your program. If infile is a file containing text, then 
command 

nletters < infile 

should cause something like 

Number of letters: 179 

to be printed on the screen. 

23 In the abc program in Section 1.8, “Arrays, Strings, and Pointers,” on page 43 
used the loop 
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#include <stdio.h> 

#include <string.h> 

#define MAXSTRING 100 

void prn„string(char *); 

int main(void) 

{ 

char si[MAXSTRING], s2[MAXSTRING]; 

strcbyj(sl, "Mary, Mary, quite contrary,\n T ') ; 
strcpy(s2, "how does your garden grow?\n M ); 
prn_string(sl); 
prn_string(s2); 

strcat(sl, s2); /* concatenate the strings */ 

prn_string(sl) ; 
return 0; 

} 


26 Redirection, like many new ideas, is best understood with experimentation. In Sec¬ 
tion 1.10, “Operating System Considerations,” on page 57, we presented the dbLout 
program. Create a file, say my^file, that contains a few lines of text. Try the follow¬ 
ing commands so that you understand the effects of using redirection with dbLout. 

dbLout < my_file 
dbLout < my_file > tmp 

The following command is of special interest: 
dbLout > tmp 

This command causes dbLout to take its input from the keyboard and to write its 
output in the file tmp, provided that you effect an end-of-file signal when you are 
finished. What happens if instead of typing a carriage return followed by a con- 
trol-d, you type a control-c to kill the program? Does anything at all get written into 
tmp? 


Lexical Elements, Operators, 
and the C System 


In this chapter, we explain the lexical elements of the C programming language. C is a 
language. Like other languages, it has an alphabet and rules for putting together words 
and punctuation to make correct, or legal, programs. These rules are the syntax of the 
language. The program that checks on the legality of C code is called the compiler. If 
there is an error, the compiler will print an error message and stop. If there are no 
errors, then the source code is legal, and the compiler translates it into object code, 
which in turn gets used by the loader to produce an executable file. 

When the compiler is invoked, the preprocessor does its work first. For that reason 
we can think of the preprocessor as being built into the compiler. On some systems, 
this is actually the case, whereas on others the preprocessor is separate. This is not of 
concern to us in this chapter. We have to be aware, however, that we can get error mes¬ 
sages from the preprocessor as well as from the compiler. (See exercise 30, on page 
106.) Throughout this chapter, we use the term compiler in the sense that, conceptually, 
the preprocessor is built into the compiler. 

A C program is a sequence of characters that will be converted by a C compiler to 
object code, which in turn gets converted to a target language on a particular machine. 
On most systems, the target language will be a form of machine language that can be 
run or interpreted. For this to happen, the program must be syntactically correct. The 
compiler first collects the characters of the program into tokens , which can be thought 
of as the basic vocabulary of the language. 

In ANSI C, there are six lands of tokens: keywords, identifiers, constants, string con¬ 
stants, operators, and punctuators. The compiler checks that the tokens can be formed 
into legal strings according to the syntax of the language. Most compilers are very pre¬ 
cise in their requirements. Unlike human readers of English, who are able to understand 
the meaning of a sentence with an extra punctuation mark or a misspelled word, a C 
compiler will fail to provide a translation of a syntactically incorrect program, no mat¬ 
ter how trivial the error. Hence, the programmer must learn to be precise in writing 
code. 
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The programmer should strive to write understandable code. A key part of doing this 
is producing well-commented code with meaningful identifier names. In this chapter we 
illustrate these important concepts. 



Program 


The compilation process 


Group characters 

_ 

into tokens 

. 


Translate tokens 
to target code 


2.1 Characters and Lexical Elements 

A C program is first constructed by the programmer as a sequence of characters. 
Among the characters that can be used in a program are the following: 


Characters that can be used in a program 

lowercase letters 

a 

b 

c 

z 


uppercase letters 

A 

B 

C * * * 

z 


digits 

0 

1 

2 3 

4 5 

6 7 8 9 

other characters 

■f* 


/ = C 

) { 1 

[]<>'" 


! 

■* 

# 

% & _ 

1 A ~ 

\ * * ? 

\ • J J * 

white space characters 

blank, 

newline , 

tab, etc. 



1 nx nid i iu.vu i uj 

{ 

int a, b, sum; 

printf("Input two integers: "); 
scanf("%d%d", &a, &b); 
sum = a + b; 

printf("%d + %d = %d\n", a, b, sum) 
return 0; 



ft 

ical Dissection of the sum Program 

/* Read in two integers and print their sum. */ 

ments are delimited by /* and */■ The compiler first replaces each commeni 
le blank. Thereafter, the compiler either disregards white space or uses it to 
tokens. 

#include <stdio.h> 

is a preprocessing directive that causes the standard header file stdio.h 
ded. We have included it because it contains the function prototypes for pri n 
:canf (). A function prototype is a land of declaration. The compiler needs 
nototypes to do its work. 

int main(void) 

{ 

int a, b, sum; 

ompiler groups these characters into four kinds of tokens. The function i 
is an identifier, and the parentheses ( ) immediately following mai n is an o 
his idea is confusing at first, because what you see following mai n is (voi d) 
inly the parentheses ( ) themselves that constitute the operator. This ope 
he compiler that main is a function. The characters and are pun 

int is a keyword; a, b, and sum are identifiers. 
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n int a, b, sum; 

The compiler uses the white space between i nt and a to distinguish the two tokens. We 
cannot write 

inta, b, sum; /* wrong: white space is necessary */ 

On the other hand, the white space following a comma is superfluous. We could have 
wri tten 


int a,b,sum; but not int absum; 
The compiler would consider absum to be an identifier. 


m printf("Input two integers: "); 
scant C%d%d" , &a, &b) ; 

The names pri ntf and scant are identifiers, and the parentheses following them tell 
the compiler that they are functions. After the compiler has translated the C code, the 
loader will attempt to create an executable file. If the code lor pri ntf () and scanfQ 
has not been supplied by the programmer, it will be taken from the standard library. A 
programmer would not normally redefine these identifiers. 

m "Input two integers: " 

A series of characters enclosed in double quotes is a string constant. The compiler 
treats this as a single token. The compiler also provides space in memory to store the 
string. 


■ &a, &b 

The character & is the address operator. The compiler treats it as a token. Even though 

the characters & and a are adjacent to each other, the compiler treats each of them as a 

separate token. We could have written 

& a , & b or &a, &b 

<| •» 4 - ■ « I 

- *w. . ,mi. %r. 

&a &b /* the comma punctuator is missing */ 

a&, &b /* & requires its operand to be on the right */ 


; 


fajaB aSSB 
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2,2 T Syntax Rules 


h + b j 



> characters = and + are operators. White space here will be ignored, so we could 
ye written 


mfe- mm 
mm 
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sum=a+b 


not 


sum 


b ; 


a + b; 
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We had written the latter, then each letter on this line would be treated by the com- 
Oer as a separate identifier. Because not all of these identifiers have been declared, the 
ntpiler would complain. 


©few. 

« a— 
P' : 

■ fm 

■ 



The compiler either ignores white space or uses it to separate elements of the lan- 
age. The programmer uses white space to provide more legible code. To the compiler, 
jjogram text is implicitly a single stream of characters, but to the human reader, it is a 
two-dimensional tableau. 






pi2 Syntax Rules 


, 'The syntax of C v\dll be described using a rule system derived from Backus-Naur Form 
(BNF), first used in 1960 to describe ALGOL 60. Although they are not adequate by 
|§l!|$emselves to describe the legal strings of C, in conjunction with some explanatory 
remarks they are a standard form of describing modern high-level languages. 

IlffV A syntactic category will be written in italics and defined by productions, also called 
rewri tin g rules, such as 


j|f digit :: - 0 | 1 | 
This should be read as 


5 | 6 | 7 | 8 | 9 



The syntactic category digit is rewritten as either the symbol 0, the symbol 1,... , or 
the symbol 9. 

The vertical bar separates alternate choices. Symbols not in italics are taken to be termi¬ 
nal symbols of the language to which no further productions are applied. 














Symbols to be used in productions 

i . ...........,.., ...1 

italics 

indicate syntactic categories 

a at ,_ 

i m 

“to be rewritten as” symbol 

i 

1 

vertical bar to separate choices 

{ >1 

choose 1 of the enclosed items 

{ 1 0+ 

repeat the enclosed items 0 or more times 

1 >i + 

repeat the enclosed items 1 or more times 

{ Jopt 

optional items 


Other items are terminal symbols of the language. 

Let us define a category letter .or.digit to mean any lower- or uppercase letter of the 
alphabet or any decimal digit. Here is one way we can do this: 

letter_or.digit ::= letter | digit 

letter : := lowercase.letter \ uppercasejetter 

lowercase.letter ::=a|b|c|... |z 

uppercasejetter :: = A | B | C | ... | Z 

digit ::=0|1|2|3|4|5|6|7|8|9 

Now let us create a category alphanumeric.string to be an arbitrary sequence of letters 
or digits. 

alphanumeric.string :: = { letter.or.digit } 0+ 

Using these productions, we see that strings of one character such as "3" and strings of 
many characters such as "ab777c" as well as the null string "" are all alphanumeric 
strings. Note that in each of our examples double-quote characters w^ere used to delimit 
the alphanumeric string. The double-quote characters themselves are not part of the 
string. 

If we wish to guarantee that a string has at least one character, we must define a new 
syntactic category, such as 

alpha.string_l ::= { letter, or .digit } 1+ 
and if we want strings that start with an uppercase letter, we could define 

u.alpha.string : := uppercasejetter { letter.or.digit}a. 


2,3 T Comments 
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Walpha.string : := uppercasejetter alphanumeric.string 

m i 

fin define a syntactic category called conditional.statement to illustrate the {} opt 
on as follows: 


conditional-statement 


if ( expression ) statement 
{else statement} opt 


iuse expression and statement have not yet been supplied with rewriting rules, this 
lory is not defined completely. Those rewriting rules are complicated, and we are 
ge'ady to present them here. In any case, some examples of this syntactic category 


if Cbig_big_big > 999) 

S huqe = giant + a_lot; /* no else part immediately 
! follows */ 

if (normalized_score >= 65) 
pass =1; 

e i se /* else part associated with preceding if part */ 


Comments 


omments are arbitrary strings of symbols placed between the delimiters /* and */ . 
Comments are not tokens. The compiler changes each comment into a single blank 
character. Thus, comments are not part of the executable program. We have already 
en examples such as 


/*** another comment ***/ 


'it it id 


kit j 


A comment can be written in this fashion 
to set it off from the surrounding code. 


/* a comment */ 


Another example is 
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The following illustrates one of many styles that can be used to highlight comments; 


! * * * * * * * * * * * * * * * * * * * * * * * * * * * * * 

* If you wish, you can * 

* put comments in a box. * 

* * * >Y * * * * * * * * * * * * * * * * * * ft * * * * ft * j 


Comments are used by the programmer as a documentation aid. The aim of docu¬ 
mentation is to explain clearly how the program works and how it is to be used. Some¬ 
times a comment contains an informal argument demonstrating the correctness of the 
program. 

Comments should be written simultaneously with program text. Although some pro¬ 
grammers insert comments as a last step, there are two problems with this approach. 
The first is that once the program is running, the tendency is to either omit or abbrevi¬ 
ate the comments. The second is that ideally the comments should serve as running 
commentary, indicating program structure and contributing to program clarity and cor¬ 
rectness. They cannot do this if they are inserted after the coding is finished. 

In C++, the C style of comment is still valid. But in addition, C++ provides for com¬ 
ments that begin with // and run to the end of the line. 


// This is a comment in C++. 

// 

// This is one common way of writing 
// a comment in C++ that consists 
// of many lines. 

// 

/* 

// This C comment style mimics the 
// previous C++ comment style. 

V 


2.4 V Keywords 



mm 

■I; 


1 / 


II''IS 


Mm i: 

2 A 


Keywords 




Keywords are explicitly reserved words that have a strict meaning as individual tokens 
18IL They cannot be redefined or used in other contexts. 


Keywords 


ssagtif.. 

- - - 




auto 

do 

goto 

signed 

unsigned 

break 

double 

if 

sizeof 

voi d 

case 

el se 

int 

static 

volatile 

char 

enum 

long 

struct 

while 

const 

extern 

register 

switch 


continue 

float 

return 

typedef 


default 

for 

short 

union 



m 


H 


flbme implementations may have additional keywords. These will vary from one imple¬ 
mentation, or system, to another. As an example, here are some of the additional key- 
Ifcds in Turbo C. 


asm 


cdecl 


Additional keywords for Borland C 
far huge interrupt near 


pascal 


I 

IB 

IBH 


Compared to other major languages, C has only a small number of keywords. Ada, for 
example, has 62 keywords. It is a characteristic of C that it does a lot with relatively few 
Special symbols and keywords. 
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.5 Identifiers 


An identifier is a token that is composed of a sequence of letters, digits, and the special 
character which is called an underscore, A letter or underscore must be the first char¬ 
acter of an identifier. In most implementations of C, the lower- and uppercase letters 
are treated as distinct. It is good programming practice to choose identifiers that have 
mnemonic significance so that they contribute to the readability and documentation of 
the program. 

identifier ::= {letter | underscore } x {letter I underscore | digit } 0+ 
underscore : _ 

Some examples of identifiers are 
k 

_id 

iamanidentifier2 

so_am_i 

but not 

not#me /* special character # not allowed */ 

101_south /* must not start with a digit */ 

-plus /* do not mistake - for _ */ 

Identifiers are created to give unique names to objects in a program. Keywords can 
be thought of as identifiers that are reserved to have special meaning. Identifiers such 
as scanf and pri ntf are already known to the C system as input/output functions in 
the standard library. These names would not normally be redefined. The identifier mai n 
is special, in that C programs always begin execution at the function called mai n. 

One major difference among operating systems and C compilers is the length of dis¬ 
criminated identifiers. On some older systems, an identifier with more than 8 charac¬ 
ters will be accepted, but only the first 8 characters will be used. The remaining 
characters are simply disregarded. On such a system, for example, the variable names 

i_am_an_identifier and i_am_an_elephant 
would be considered the same. 

In ANSI C, at least the first 31 characters of an identifier are discriminated. Many C 
systems discriminate more. 
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Decimal integers are finite strings of decimal digits. Because C provides octal and 
hexadecimal integers as well as decimal integers, we have to be careful to distinguish 
between the different kinds of integers. For example, 17 is a decimal integer constant, 
017 is an octal integer constant, and 0x17 is a hexadecimal integer constant. (See Chap¬ 
ter 3, “The Fundamental Data Types,” for further discussion.) Also, negative constant 
integers such as -33 are considered constant expressions. 

decimaLinteger 0 | positive^decimaLinteger 
positive_decimaLinteger ::= positive digit {digit } 0+ 
positive_digit ::=1|2|3|4|5|6|7|8|9 

Some examples of constant decimal integers are 

0 

77 

123456789000 /* too large for the machine? */ 

but not 

0123 /* an octal integer */ 

-49 /* a constant expression */ 

123.0 /* a floating constant */ 

Although we have already used integer constants such as 144 and floating constants 
such as 39.7, their meaning in terms of type, along with details concerning memory 
requirements and machine accuracy, is complicated enough to require a thorough dis¬ 
cussion, We do this in Chapter 3, “The Fundamental Data Types.” 


2.7 String Constants 

A sequence of characters enclosed in a pair of double-quote marks, such as ”abc”, is a 
string constant, or a string literal. It is collected by the compiler as a single token. In 
Section 6.10, “Strings,” on page 270, we will see that string constants are stored by the 
compiler as arrays of characters. String constants are always treated differently from 
character constants. For example, ' a f and "a" are not the same. 

Note that a double-quote mark " is just one character, not two. If the character " 
itself is to occur in a string constant, it must be preceded by a backslash character \ . If 
the character \ is to occur in a string constant, it too must be preceded by a backslash. 
Some examples of string constants are 


2.8 Y Operators and Punctuators 
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s|g 1 piigfitgfi 

string with double quotes \" within” 

1 “ a single backslash \\ is in this string” 


string of text 

/* the null string */ 
11 /* a string of blank characters */ 

»» a - p + c;” /* nothing is executed */ 

«»/* this is not a comment */" 

• . _! ~ ~ -i- ~ _ \ H ..Jj-U.'-ll 


II 


§ but not 

"this is not a string” */ 

"and 

neither is this” 

Character sequences that would have meaning if outside a string constant are just a 
sequence of characters when surrounded by double quotes. In the previous examples, 
« firing contains what appears to be the statement a = b + c;, but since it occurs 
surrounded by double quotes, it is explicitly this sequence of characters. 

Two string constants that are separated only by white space are concatenated by the 
| compiler into a single string. Thus, 


. 


"abc" "def" 


is equivalent to 


"abcdef" 


This is a new feature of the language available in ANSI C, but not in traditional C. 

String constants are treated by the compiler as tokens. As with other constants, the 
compiler provides the space in memory to store string constants. We will emphasize 
this point again in Section 6.10, “Strings,” on page 270, when we discuss strings and 
pointers. 




Operators and Punctuators 


In C, there are many special characters with particular meanings. Examples include the 
§g£i arithmetic operators 


p : which stand for the usual arithmetic operations of addition, subtraction, multiplication, 
B division, and modulus, respectively. Recall that in mathematics the value of a modulus 
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b is obtained by taking the remainder after dividing a by b. Thus, for example, 5 % 3 
has the value 2, and 7 % 2 has the value 1. 

In a program, operators can be used to separate identifiers. Although typically we 
put white space around binary operators to heighten readability, this is not required. 


a+b 

a_b 


/* this is the expression a plus b */ 
/* this is a 3-character identifier */ 


Some symbols have meanings that depend on context. As an example of this, consider 
the % symbol in the two statements 


printf("%d", a); 


and 


b % 7 


The first % symbol is the start of a conversion specification, or format, whereas the sec¬ 
ond % symbol represents the modulus operator. 

Examples of punctuators include parentheses, braces, conunas, and semicolons. Con¬ 
sider the following code: 

int main(void) 

{ 

int a, b = 2, c = 3; 
a = 17 * (b + c); 


The parentheses immediately following mai n are treated as an operator. They tell the 
compiler that mai n is the name of a function. After this, the symbols “{”, “, ”, “; ”, “(”, 
and “)” are punctuators. 

Both operators and punctuators are collected by the compiler as tokens, and along 
with white space, they serve to separate language elements. 

Some special characters are used in many different contexts, and the context itself 
can determine which use is intended. For example, parentheses are sometimes used to 
indicate a function name; at other times they are used as punctuators. Another example 
is given by the expressions 


a + b 


++a 


a += b 


They all use + as a character, but ++ is a single operator, as is +=. Having the meaning of 
a symbol depend on context makes for a small symbol set and a terse language. 


2.9 Precedence and Associativity of Operators 

Operat ors have rules of precedence and associativity that are used to determine how 
expressions are evaluated. These rules do not fully determine evaluation because the 
compiler has leeway to make changes for its own purposes. Since expressions inside 
parentheses are evaluated first, parentheses can be used to clarify or change the order 
ih which operations are performed. Consider the expression 


ISf®!! 




1 + 2*3 


In C, the operator * has higher precedence than +, causing the multiplication to be per¬ 
formed first, then the addition. Hence, the value of the expression is 7. An equivalent 
expression is 

H§i +(2*3) 

On the other hand, because expressions inside parentheses are evaluated first, the 
ression 




+ 2) * 3 

different; its value is 9. Now consider the expression 
1 + 2 - 3 + 4 - 5 

Because the binary operators + and - have the same precedence, the associativity rule 
Heft to right” is used to determine how it is evaluated. The “left to right” rule means 
that the operations are performed from left to right. Thus, 

ij((l + 2) - 3) + 4) - 5 
is an equivalent expression. 

The following table gives the rules of precedence and associativity for some of the 
erators of C. In addition to the operators we have already seen, the table includes 
operators that will be discussed later in this chapter. 
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Operator precedence and associativity 


Operator 


C) ++ C postfix) 

+ ( unary ) - {unary) 


- ( postfix ) 
++ ( prefix ) 


( prefix) 


etc, 


Associativity 

left to right 
right to left 
left to right 
left to right 
right to left 


All the operators on a given line, such as *, /, and %, have equal precedence with 
respect to each other, but have higher precedence than all the operators that occur on 
the lines below them. The associativity rule for all the operators on a given line appears 
at the right side of the table. Whenever we Introduce new operators, we will give their 
rules of precedence and associativity, and often we will encapsulate the information by 
augmenting this table. These rules are essential information for every C programmer. 

In addition to the binary plus, which represents addition, there is a unary plus, and 
both these operators are represented by a plus sign. The minus sign also has binary and 
unary meanings. Note carefully that the unary plus was introduced with ANSI C. There 
is no unary plus in traditional C, only unary minus. 

From the preceding table we see that the unary operators have higher precedence 
than the binary plus and minus. In the expression 

- a * b - c 

the first minus sign is unary and the second binary. Using the rules of precedence, we 
see that the following is an equivalent expression: 

((- a) * b) - c 


2.10 ▼ Increment and Decrement Operators 
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10 Increment and Decrement Operators 
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Ijpincrement operator ++ and the decrement operator — are unary operators. They 
Jjlle unusual because they can be used as both prefix and postfix operators. Suppose 
ISjtllfis a variable of type i nt. Then both ++val and val++ are valid expressions, with 
§lp/al illustrating the use of ++ as a prefix operator and val ++ illustrating the use of ++ 
§j|a postfix operator. In traditional C, these operators have the same precedence as the 
IfMffigr unary operators. In ANSI C, for technical reasons they have the very highest pre- 
ffllclence and left-to-right associativity as postfix operators, and they have the same pre- 
Ifljilence as the other unary operators and right-to-left associativity as prefix operators. 
Both ++ and — can be applied to variables, but not to constants or ordinary expres- 


V' 

\rr A -■ . 


r'lfflpns. Moreover, different effects may occur depending on whether the operators 
!|J| prefix or postfix position. Some examples are 
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j§§777++ 
U+++(a * b 


/* constants cannot be incremented */ 

1) /* ordinary expressions cannot be incremented */ 
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Each of the expressions ++i and i++ has a value; moreover, each causes the stored 
jtailiie of i in memory to be incremented by 1. The expression ++i causes the stored 
flaliie of i to be incremented first, with the expression then taking as its value the new 
Isfpred value of i. In contrast, the expression i ++ has as its value the current value of i; 
llpi the expression causes the stored value of i to be incremented. The following code 
illustrates the situation: 


:f\- 


»:«5,rA’i'lAxvf’: \ 


int 


a, b, c 


gif a = ++c; 

b = C++; 

+ printf("%d 


%d %d\n' ! 1 a, b, ++c) 


/* 1 1 3 is printed */ 
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87 


In a similar fashion, the expression —i causes the stored value of i in memory to be 
decremented by 1 first, with the expression then taking this new stored value as its 
value. With i —, the value of the expression is the current value of i; then the expres¬ 
sion causes the stored value of i in memory to be decremented by 1. 

Note carefully that ++ and — cause the value of a variable in memory to be changed. 
Other operators do not do this. For example, an expression such as a + b leaves the 
values of the variables a and b unchanged. These ideas are expressed by saying that the 
operators ++ and — have a side effect, not only do these operators yield a value, they 
also change the stored value of a variable in memory. (See exercise 14, on page 99.) 

In some cases, we can use ++ in either prefix or postfix position, with both uses pro¬ 
ducing equivalent results. For example, each of the two statements 

++i; and i++; 

is equivalent to 

i = i +1; 

In simple situations, one can consider ++ and — as operators that provide concise nota¬ 
tion for the incrementing and decrementing of a variable. In other situations, careful 
attention must be paid as to whether prefix or postfix position is desired. 


Declarations and initializations 

int a = 1, b=2, c=3, d=4; 

Expression 

Equivalent expression 

Value 

a * b / c 

(a * b) / c 

0 

a * b % c + 1 

((a * b) % c) + 1 

3 

++ a * b - c -- 

(C++ a) * b) - (c --) 

1 

7 — —> h * 4 - 4 - rt 

7 - ((- b) * C++ d)) 

17 
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Assignment Operators 


l^ftchange the value of a variable, we have already made use of assignment statements 
las 



fea = b + c 

other languages, C treats = as an operator. Its precedence is lower than all the 
lllp afors we have discussed so far, and its associativity is right to left. In this section 
explain in detail the significance of this. 

-To understand = as an operator, let us first consider + for the sake of comparison. 
IRUehinarv operator + takes two operands, as in the expression a + b. The value of the 
Pfflffession is the sum of the values of a and b. By comparison, a simple assignment 
y^ tession is of the form 

right_side 


8# 


« 


m 


variable 

ami: 




m 


S|®iiLfere right-side is itself an expression. Notice that a semicolon placed at the end would 
made this an assignment statement. The assignment operator = has the two oper- 
ancls variable and right_side. The value of righLside is assigned to variable, and that 
value becomes the value of the assignment expression as a whole. To illustrate this, 
BJItlisider the statements 


.... ... ... 

itet t : 

> - 

; -'^v> 


■ -V. v w 

. 






b 

c 

a 


= 2 ; 

= 3; 

— h 4 ~ 
«— p.J r 


jr* * 
v II 


gf$yhere the variables are all of type i nt. By making use of assignment expressions, we 
Sifan condense this to 


(b = 2) + (c = 3); 


j|g|The assignment expression b 


2 assigns the value 2 to the variable b, and the assign- 
Mfig nt expression itself takes on this value. Similarly, the assignment expression c = 3 


m 


|Sssigns the value 3 to the variable c, and the assignment expression itself takes on this 


toft 


Finally, the values of the two assignment expressions are added, and the result- 
IBjlng value is assigned to a. 


V- ' • 
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Although this example is artificial, there are many situations where assignment 
occurs naturally as part of an expression. A frequently occurring situation is multiple 
assignment. Consider the statement 

a = b = c = 0 ; 

Because the operator = associates from right to left, an equivalent statement is 
a = (b = (c = 0 )); 

First, c is assigned the value 0, and the expression c = 0 has value 0. Then b is assigned 
the value 0, and the expression b = (c = 0) has value 0. Finally, a is assigned the value 
0, and the expression a = (b = (c = 0 )) has value 0. Many languages do not use 
assignment in such an elaborate way. In this respect C is diflerent. 

In addition to =, there are other assignment operators, such as += and -= . An expres¬ 
sion such as 

k = k + 2 

will add 2 to the old value of k and assign the result to k, and the expression as a whole 
will have that value. The expression 

k += 2 

accomplishes the same task. The following table contains all the assignment operators: 

Assignment operators • 

= += -= *= /= %= »= «= &= A = 1 = 

All these operators have the same precedence, and they all have right-to-left associativ¬ 
ity’. The semantics is specified by 

variable op= expression 
which is equivalent to 

variable = variable op ( expression ) 

with the exception that if variable is itself an expression, it is evaluated only once. 
When dealing with arrays, this is an important technical point. Note carefully that an 
assignment expression such as 



,*= k + 3 
than 


is equivalent to 


2.12 T An Example: Computing Powers of 2 


(k + 3) 


— j * k + 3 

allowing table illustrates how assignment expressions are evaluated: 


int 


mmm 


arations and initializations 


1 = 1, j = 2, k = 3, m = 4; 



Equivalent expression 

Equivalent expression 

i += (j + k) 

i = Ci + (j + k)) 

j *= (k = (m + 5)) 

j = (j * (k = (m + 5))) 


Value 




trip 





An Example: Computing Powers of 2 


illustrate some of the ideas presented in this chapter, we will write a program that 
on a line some powers of 2. Here is the program: 


Jp Some powers of 2 are printed. */ 
#include <stdio.h> 
ant main (void) 

ilf 

»:% int i = 0, power = 1; 


while (++i <= 10) 

printf("%-6d", power * 
printf("\n"); 
return 0: 


e output of the program is 


EH 
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Dissection of the pow_of_2 Program 


■ /* Some powers of 2 are printed. */ 

Programs often begin with a comment explaining the program’s intent or use. In a large 
program, comments may be extensive. The compiler treats comments as white space. 


■ #include <stdio.h> 

The header file stdio.h contains the function prototype for the printf () function, lhis 
is a kind of declaration for pri ntf (). The compiler needs it to do its work correctly. 
(See Section 2.13, “The C System,” on page 91, for further details.) 


■ int i = 0, power = 1; 

The variables i and power are declared to be of type i nt. They are initialized to 0 and 
1, respectively. 


■ while (++i <= 10) 

As long as the value of the expression ++i is less than or equal to 10, the body of the 
while loop is executed. The first time through the loop, the expression ++i has the 
value 1; the second time through the loop, ++i has the value 2; and so forth. Thus, the 
body of the loop is executed ten imes. 


■ printf("%-6d", power *= 2); 

The body of the whi 1 e loop consists of this statement. The string constant "%-6d" is 
passed as the first argument to the pri ntf O function. The string contains the lormat 
%-6d, which indicates that the value of the expression power *= 2 is to be printed as a 
decimal integer with field width 6. The minus sign indicates that the value is to be left 
adjusted in its field. 


■ power *= 2 

This assignment expression is equivalent to power = power * 2 which causes the old 
value of power to be multiplied by 2 and the resulting value to be assigned to power. 
The value assigned to power is the value of the assignment expression as a whole. 



2.1 3 ▼ The C System 


iigjjie first time through the whi 1 e loop, the old value of power is 1, and the new value is 
!§fg|tlie second time through the loop, the old value of power is 2, and the new value is 4, 
Ssand s0 forth - 


Jp 3 The c System 

The C system consists of the C language, the preprocessor, the compiler, the library, 
and other tools useful to the programmer, such as editors and debuggers. In this sec- 
I Hit we discuss the preprocessor and the library. (See Chapter 8, “The Preprocessor,” 
Tpr further details about the preprocessor. See Appendix A, “The Standard Library,” for 
§§j|||gtails about functions in the standard library.) 


The Preprocessor 


fines that begin with a # are called preprocessing directives. These lines communicate 
with the preprocessor. In traditional C, preprocessing directives were required to begin 
BBillg olumn 1. In ANSI C, this restriction has been removed. Although a # may be pre- 
|||p|i ed on a 11116 by white space, it is still a common programming style to start prepro- 
ig|s!'sing directives in column 1. 

•fSillWe have already used preprocessing directives such as 

Jlfc 

g #i nclode <stdio.h> and #define PI 3.14159 

ww p 

-~8J§Siother form of the #i ncl ude facility is given by 

Pjfifife 1 

ncl ude "filename" 

ggj|§i? causes the preprocessor to replace the line with a copy of the contents of the 
lf§Sl leci file - A search for the file is made first in the current directory and then in other 
gjfystem-dependent places. With a preprocessing directive of the form 


§? #i ncl ude < filename > 

;w». vigil 

Jhe preprocessor looks for the file only in the “other places” and not in the current 
directory. 

Because #incl ude directives commonly occur at the beginning of the program, the 
igjude files they refer to are called header files, and a . h is used to end the file name. 
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This is a convention; the preprocessor does not require this. There is no restriction on 
what an include file can contain. In particular, it can contain other preprocessing direc¬ 
tives that will be expanded by the preprocessor in turn. Although files of any type may 
be included, it is considered poor programming style to include files that contain the 
code for function definitions. (See Chapter 5, “Functions.”) 

On UNIX systems, the standard header files such as stdio.h are typically found in the 
directory /usr/include. On Borland C systems, they might be found in the directory 
c:\bc\include or in some other directory. In general, the location of the standard 
#i ncl ude files is system-dependent. All of these files are readable, and programmers, 
for a variety of reasons, have occasion to read them. 

One of the primary uses of header files is to provide function prototypes. For exam¬ 
ple, the file stdio.h contains the following lines: 

int printf(const char *format, ...); 

int scanf(const char ^format, ...); 

These are the function prototypes for the pri ntf () and scanf () functions in the stan¬ 
dard library. Roughly speaking, a function prototype tells the compiler the types of the 
arguments that get passed to the function and the type of the value that gets returned 
by the function. Before we can understand the function prototypes for pri ntf () and 
scanf (), we need to learn about the function-definition mechanism, pointers, and type 
qualifiers. These ideas are presented in later chapters. The point we are making here is 
that when the programmer uses a function from the standard library, then the corre¬ 
sponding standard header file should be included. The header file will provide the 
appropriate function prototype and other necessary constructs. The compiler needs the 
function prototype to do its work correctly. 

The Standard Library 

The standard library contains many useful functions that add considerable power and 
flexibility to the C system. Many of the functions are used extensively by all C program¬ 
mers, whereas other functions are used more selectively. Most programmers become 
acquainted with functions in the standard library on a need-to-know basis. 

Programmers are not usually concerned about the location on the system of the stan¬ 
dard library because it contains compiled code that is unreadable to humans. The stan¬ 
dard library may comprise more than one file. The mathematics library, for example, is 
conceptually part of the standard library, but it often exists in a separate file. (See exer¬ 
cise 25, on page 105, and exercise 26, on page 105.) Whatever the case, the system 
knows where to find the code that corresponds to functions from the standard library, 
such as pri ntf () and scanf (), that the programmer has used. Note carefully, how¬ 
ever, that even though the system provides the code, it is the responsibility of the pro- 
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Sprner to provide the function prototype. This is usually accomplished by including 
header files. 


WjjilUtion: Do not mistake header files for the libraries themselves. The standard 
^BKry contains object code of functions that have already been compiled. The stan- 
Jra &WB .header files do not contain compiled code. 
a |ra lo'.aTi illustration of the use of a function in the standard library, let us show how 
mmO can be used t0 g enerate some randomly distributed integers. In later chapters 
have occasion to use rand() to fill arrays and strings for testing purposes, 
g we use it to print some integers on the screen. 






is* prn - rand - c 




nc i ude <stdi o. h> 


#i nclude <stdlib.h> 


ragbag j >>: - 

IV llll tint main (void) 


mumd 



int 


n 


l 




: 

f : ' ' .iPw 

...... v-Vfr.xg- 

MfKEiK. 
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printf("\n%s\n%s", 

"Some randomly distributed integers will be printed 
"How many do you want to see? "); 
scanf("%d", &n); 
for (i =0; i < n; ++i) { 
if (i % 10 == 0) 
putchar(’\n’); 
printf("%7d", rand()); 

} 

printf("\n\n"); 
return 0; 


I 

ffyt A < 




m 


|j||tt^pose that we execute the program and type 19 when prompted. Here is what 
J|j§p.pears on the screen: 


_ > 5 ' V... ’ 

v- - , V.V/V 


JlgjlfSoine randomly distributed integers will be printed 
||§jjjjjHow many do you want to see? 23 






MB 

16838 

5758 

10113 

17515 

31051 

5627 

23010 

7419 


16212 

4086 

2749 

12 767 

9084 

12060 

32225 

17543 


25089 

21183 

25137 

25566 

26966 

4978 

20495 
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Dissection of the prn_rand Program 


a #include <stdio.h> 
finclude <stdlib.h> 

These header files are included because of the function prototypes they contain. In par¬ 
ticular, the function prototype 

f 

int rand(void); 

is in stdlib.h. It tells the compiler that randQ is a function that takes no arguments and 
returns an i nt value. Rather than include stdlib.h, we could instead supply this line our¬ 
selves at the top of the file just before mai n(). However, it is both easier and safer to 
include the header file. 

■ printf("\n%s\n%s", _ . 

"Some randomly distributed integers will be printed. , 

"How many do you want to see? "); 
scanf("%d", &n); 

A prompt to the user is printed on the screen. The characters typed in by the user are 
received by scanf (), converted in the format of a decimal integer, and placed at the 
address of n. 

■ for (i = 0; i < n; ++i) { 

i i * t « 

} 

This is a for loop. It is equivalent to 
i = 0; 

while’(i < n) { 

++i; 


} 



2.13 T The C System 


way to write this program would be to initialize i to zero and then use the con¬ 


ch 


while (i++ < n) { 


“itcarefully that i ++ < n is different from ++i < n. (See exercise 19, on page 101.) 

: 'if (i % 10 == 0) 
putchar(’\n'); 
jprintf ("%7d" , randQ) ; 

operator == is the “is equal to” operator, If exprl and expr2 are two expressions 
M y, rig the same value, then the expression exprl == expr'2 will be true, otherwise it will 
■false. In Section 4.3, “Equality Operators and Expressions,” on page 152, we will see 
has lower precedence than % . Thus, 


i % 10 


is equivalent to 


(i % 10) 


Iff first time through the loop and every eighth time thereafter, the expression as a 
fj|leis true. Whenever the expression is true, a newline character gets printed. 


■ilB hrintf ("%7d". randO); 

ry time through the loop, the value returned by the call to randQ is printed in the 
at of a decimal integer. The width of the field where the integer gets printed is 7. 




most uses of randQ, the programmer needs to seed the random-number generator 
are it gets used. This can be done with the following line: 

srand(time(NULL)); 

i exercise 22, on page 102, for further discussion. Also, see exercise 25, on page 105, 
the use of other random-number generators.) 
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Summary 

1 Tokens are the basic syntactic units of C. They include keywords, identifiers, con¬ 
stants, string constants, operators, and punctuators. White space, along with opera¬ 
tors and punctuators, can serve to separate tokens. For this reason, white space, 
operators, and punctuators are collectively called separators. White space, other 
than serving to separate tokens, is ignored by the compiler. 

2 Comments are enclosed by the bracket pair /'■' and */ and are treated as white 
space by the compiler. They are critical for good program documentation. Com¬ 
ments should assist the reader to both use and understand the program. 

3 A keyword, also called a reserved word, has a strict meaning. There are 32 key¬ 
words in C. They cannot be redefined. 

4 Identifiers are tokens that the programmer uses chiefly to name variables and func¬ 
tions. They begin with a letter or underscore and are chosen to be meaningful to the 

human reader. 

5 Some identifiers are already known to the system because they are the names of 
functions in the standard library. These include the input/output functions 
scanf () and printf () and mathematical functions such as sqrt(), si n(), cos(), 

and tan(). 

6 Constants include various kinds of integer and floating constants, character con¬ 
stants such as 'a' and and string constants such as "abc". All constants are 

collected by the compiler as tokens. 

7 String constants such as "deep blue sea" are arbitrary sequences of characters, 
including white-space characters, that are placed inside double quotes. A string 
constant is stored as an array of characters, but it is collected by the compiler as a 
single token. The compiler provides the space in memory needed to store a string 
constant. Character constants and string constants are treated differently. For 
example, 'x‘ and "x" are not the same. 


▼ Summary 



^Operators and punctuators are numerous in C. The parentheses that follow mai n in 
lithe code 

jjjjHnt main (void) 

IfsSly? X 

. ' v. 

Jw Y _ —7^ > 

» ■ * » « 

W constitute an operator; they tell the compiler that mai n is a function. The parenthe- 
I* ses in the expression a * (b + c) are punctuators. The operations inside the 
parentheses are done first. 


-iv- -V.v': 


Hire, the rules of precedence and associativity for operators determine how an 
Suppression gets evaluated. The programmer needs to know them. 


j^a^The increment operator ++ and the decrement operator — have a side effect. In 
||8l§fiaddition to having a value, an expression such as ++i causes the stored value of i 
memory to be incremented by 1. 


WWfi The operators ++ and — can be used in both prefix and postfix positions, possibly 

3 Bipith different effects. The expression ++i causes i to be incremented in memory, 
||and the new value of i is the value of the expression. The expression i ++ has as its 
7 value the current value of i, and then the expression causes i to be incremented in 
.Ilmemory. 








In C, the assignment symbol is an operator. An expression such as a = b + c 
assigns the value of b + c to a, and the expression as a whole takes on this value. 
Although the assignment operator in C and the equal sign in mathematics look 
alike, they are not comparable. 

The standard library contains many useful functions. If the programmer uses a 
function from the standard library, then the corresponding standard header file 
should be included. The standard header file provides the appropriate function 
prototype. 
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. . — . . . mmmmmm m mmmm . 

Pvo *£•£%#* 

L.ACI 

1 Is mai n a keyword? Explain. 

2 List five keywords and explain their use. 

3 Give two examples of each of the six types of tokens. 

4 Which of the following are not identifiers and why? 

34 d _yes o_no_o_no 00_go star*it 

l_i_am one_i_aren"t me_to-2 xYshouldl int 

5 Design a standard form of introductory comment that will give a reader informa¬ 
tion about who wrote the program and why. 

6 Take a symbol such as + and show the different ways it can be used in a program. 

7 ANSI C does not provide for the nesting of comments, although many compilers 
provide an option for this. Try the following line on your compiler and see what 
happens: 

/* This is an attempt /* to nest V a comment. V 

8 The ANSI C committee is considering adding the C++ comment style to the C lan¬ 
guage, and some compilers already accept this. If you put a line such as 

// Will this C++ style comment work in C? 

in one of your working C programs, what happens on your system? Even if your C 
compiler does accept this comment style, you should he aware that many compilers 
do not. If you want your code to be portable, do not use C++ style comments. 

9 Write an interactive program that converts pounds and ounces to kilograms and 
grams. Use symbolic constants that are defined at the top of the file outside of 
mai n(). 

10 This exercise illustrates one place where white space around operators is impor¬ 
tant. The expression a+++b can be interpreted as either 
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Declarations and initializations 


int a = 2, b = -3, c=5,d 


7, e = 11; 


Expression 


a / b / c 

7 + c * - 


d / e 


2*a%-b + c + l 
39 / - ++ e - + 29 % c 
a += b += c +— 1+2 
7 - + ++ a % (3 + b) 


17 Consider the following code: 


Equivalent expression 


(a / b) / c 
7 + ((c * ( 


d)) / e) 


Value 


error 


i nt 


1, b = 2, c 


a += b += c += 7; 

Write an equivalent statement that is fully parenthesized. What are the values of 
the variables a, b, and c? First write down your answer. Then write a test program 
to check your answer. 

18 A good programming style includes a good layout style, and a good layout style is 
crucial to the human reader, even though the compiler sees only a stream of charac¬ 
ters. Consider the following program: 

int main(void 
){float qx, 

id 

tt;printf("gimme 3" 

);scanf 

( "%f%f %f",&qx,&zz 

,&tt);printf("averageis=%f", 

(qx+tt+zz)/3.0);return 
0 
;} 

Although the code is not very readable, it should compile and execute, lest it to see 
if that is true. Then completely rewTite the program. Use white space and comments 
to make it more readable and well documented. Hint: Include a header file and 
choose new identifiers to replace qx, zz, and tt. 


mt-. ft mm 




▼ Exercises 




a§BS ~- __ , - 

K&T. .. ■ 


Rewrite the prn_rand program in Section 2.13, “The C System,” on page 93, replac- 
® the for loop with the following whi 1 e loop: 



... whi1e (i++ < n) { 





After you get your program rumring and understand its effects, rewrite the pro- 
am, changing 


j.v;; 

b:-- ■ -- ' " T: 



i++ < n 


++i < n 




mm : Wmrnm 

sir? 

lig&xSS&aa iSSSsssS2852552! 



§§®|51 


§@851.;,“*. igSpgaSjiAjja.'gigiifei 


Igtear ■ ■' li. 

■ zy.y -- 


Hgg 

i, « 


n.ai 


!lpg ^ 

■life 'taMa^gS 


S'v- ■ ■ | 

Ba , fi-fT' *1 



Now the program will behave differently. To compensate for this, rewrite the body 
-the whi 1 e loop so that the program behaves exactly as it did in the beginning. 

Xhe integers produced by the function rand() all fall within the interval [0, n\, 
|vjiere n is system-dependent. In ANSI C, the value for n is given by the symbolic 
constant RAND_MAX, which is defined in the standard header file stdlib.h. Write a 
program that prints out the value of RAND_MAX on your system. Hint: Include the 
header file stdlib.h and use the line 

printf("RAND_MAX = %d\n", RAND_MAX); 

If possible, run your program on a number of different C systems. You will proba¬ 
bly find that RAND_MAX has the same value on all systems. The reason for this is 
that the ANSI C committee suggested how the function rand() could be imple¬ 
mented, and most compiler writers followed the committee’s suggestions verbatim. 
It has been our experience that C systems on PCs, UNIX workstations, and even the 
Cray supercomputer in San Diego all use the same value for RAND_MAX and that on 
all of these systems rand() produces the same output values. (See exercise 25, on 
age 105, for further discussion.) 

Run the prn_rand program in Section 2.13, “The C System,” on page 9.3, three times 
to print out, say, 100 randomly distributed integers. Observe that the same list of 
numbers gets printed each time. For many applications, this is not desirable. Mod¬ 
ify the prn_rand program by using srandQ to seed the random-number generator. 
The first few lines of your program should look like 
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#include <stdio.h> j 

#include <stdlib.h> f 

#include <ti me . h> | 

int main(void) 

{ 1 

int i, n, seed; ,3 

seed = time(NULL); 
srand(seed); 
printf("\n%s\n%s", 

"Some randomly distributed integers will be printed.", 

"How many do you want to see? "); 

• « ■ a * 

On most C systems, the function call time (NULL) returns the number of elapsed 
seconds since 1 January 1970. (See Appendix A, “The Standard Library.”) We store 
this value in the variable seed, and then we use the function call s rand (seed) to 
seed the random-number generator, Repeated calls to randQ will generate all the 
integers in the interval [0, RAND_MAX], but in a mixed-up order. The value used to 
seed the random-number generator determines where in the mixed-up order 
randQ will start to generate numbers. If we use the value produced by time() as a 
seed, then the seed will be different each time we call the program, causing a differ¬ 
ent set of numbers to be produced. Run this program repeatedly. You should see a 
different set of numbers printed each time. Do you? 

22 In the previous exercise, we suggested the code 

seed = time(NULL); 
srand(seed); 

In place of these lines, most programmers would write 
srand(time(NULL)); 


Make this change to your program, and then compile and execute it to see that it 
behaves the same as before. 
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the two previous exercises we used the value returned by time() to seed the ran- 
^Bppom-number generator. In this exercise we want to use the value returned by 
me() to measure the time it takes to call randQ. Here is one way this can be 

llilill done: 

iRr 

^^^Sinclude <stdio.h> 

^^^rfinclude <stdlib.h> 

^^^#iinclude <time.h> 

PPM J; 


mm 
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fdefine 
#define 
#define 


NCALLS 

NCOLS 

NLINES 


10000000 


/* number of fet calls */ 
/* number of columns */ 
/* number of lines */ 


m. 
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^^fc|*int main (void) 
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int i, val; 
long begin, diff, end; 

begin = time(NULL); 
srand(time(NULL)); 

printf("\nTIMING TEST: %d calls to rand()\n\n", NCALLS)- 
for (i =1; i <= NCALLS; ++i) { 
val = randQ; 

if (i <= NCOLS * NLINES) { 
printf("%7d", val); 
if (i % NCOLS == 0) 
putchar('\n'); 

} 

else if (i == NCOLS * NLINES + 1) 
printf("%7s\n\n", "."); 

} 

end = time(NULL); 
diff = end - begin; 

printf("%s%ld\n%s%ld\n%s%ld\n%s%.10f\n\n", 



end time: 
begin time: 
elapsed time: 
"time for each call: 
return 0; 


end, 
begin, 
diff, 

(double) diff / NCALLS); 
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Here is the output on our system: 


TIMING TEST: 10000000 

calls to 

randQ 




11753 

m mut» ■* 9 

5493 

23634 

23237 

24988 

31011 

19570 

432 12211 

9712 

30284 

31480 

JL 

30292 

13830 

27126 17405 

4877 

19045 

7305 

1114 

28874 


end time: 868871916 






begin time: 868871900 





elapsed time: 16 





time for 

each call: 0. 

0000016000 





The intent of this program is to print out some of the values produced by the call to 
randQ but not all of them. After all, looking at ten million numbers on the screen 
is not too interesting. Experiment with this program by modifying some of the 
#defi nes so that you can see what their effects are. For example, try making the 
following changes: 


#defi ne 

NCALLS 

1000 

/ 4 .** 
/" 

number 

of 

fet calls */ 

#define 

NC0LS 

7 

/* 

number 

of 

columns */ 

#define 

NLINES 

7 


number 

of 

lines */ 


Caution: If you are on a time-shared machine, then the use of values returned by 
time() to time things can be misleading. Between your calls to timeQ, the 
machine may be servicing other requests, making your timing results inaccurate. 
The proper way to time C code is with the use of the cl ock() function. (See Section 
11.16, “How to Time C Code,” on page 528.) 


24 The function randQ returns values in the interval 10, RAND_MAX], (See exercise 20, 
on page 101.) If we declare the variable median of type double and initialize it to 
have the value RAND_MAX/2.0, then randQ will return a value that is sometimes 
larger than median and sometimes smaller. On average, there should be as many 
values that are larger as there are values that are smaller. Test this hypothesis. 
Write a program that calls randQ, say 500 times, inside a for loop, increments the 
variable above_cnt every time randQ returns a value larger than median, and 
increments the variable below_cnt every time randQ returns a value less than 
median. Each time through the for loop, print out the value of the difference of 
above_cnt and below.cnt. This difference should oscillate about zero. Does it? 



▼ Exercises 


US®!* 


jggljp this exercise we continue with the discussion started in exercise 20, on page 101. 
Jjferay call to randQ produces a value in the interval [0, RAND_MAX], and RAND_MAX typi- 
i§©cally has the value 32767. Since this value is rather small, randQ is not useful for 
iSSStjnanv scientific problems. Most C systems on UNIX machines provide the program- 
with the rand48 family of random-number generators, so called because 48-bit 

-T< 

«p|£%ithmetic gets used to generate the numbers. The function drand48Q, for exam- 
■*H£$-ple, can be used to produce randomly distributed doubles in the range [0,1], and 
ifejhe function 1 rand48Q can be used to produce randomly distributed integers in 
jfK|§|the range [0, 2 31 - 1]. Typically, the function prototypes for this family of functions 
in stdlib.h. Modify the program that you wrote in exercise 20, on page 101, to 




WSMB:':: 





/Smse 1 rand48Q in place of randQ and srand48Q in place of srandQ. You will see 
JQthat, on average, larger numbers are generated. Whether the numbers are better 
ItjESdepends on the application. To find out more about pseudo random-number gener- 
Ijlliators, consult the text Numerical Recipes in C by William Press et al. (Cambridge, 
aBfefcngland: Cambridge University Press, 1992), pages 274-328. 
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value of expressions such as ++a + a++ and a += ++a are system-dependent, 
*^^d)ecause the side effects of the increment operator ++ can take place at different 
„..Mutinies. This is both a strength and a weakness of C. On the one hand, compilers can 
do what is natural at the machine level. On the other hand, because such an expres- 
Ssion is system-dependent, the expression will have different values on different 
“^pinachines. Experienced C programmers recognize expressions such as this to be 
IS potentially dangerous and do not use them. Experiment with your machine to see 
KfrittWhat value is produced by ++a + a++ after a has been initialized to zero. Unfortu- 
f§t/nately, many compilers do not warn about the danger of such expressions. What 
-•^^fsrhappens on your system? 
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^libraries on a UNIX system typically end in .a, which is mnemonic for “archive,” 
whereas libraries in Win 95/NT systems typically end in .lib. See if you can find the 
'al standard C libraries on your system. These libraries are not readable by humans. 




■gap On a UNIX system you can give a command such as 


ar t /usr/lib/libc.a 



^1, W, f/y <|JV , 

fe; to see all the titles (names) of the objects in the library. If you do not see any math- 
"Tjematical functions, then the mathematics library is probably in a separate file. Try 

Jlthe command 

Hit 




ar t /usr/lib/libm.a 
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28 In both ANSI C and traditional C, a backslash at the end of a line in a string constant 
has the effect of continuing it to the next line. Here is an example of this: 

"by using a backslash at the end of the line \ 
a string can be extended from one line to the next" 

Write a program that uses this construct. Many screens have 80 characters per line. 
What happens if you try to print a string with more than 80 characters? 

29 In ANSI C, a backslash at the end of any line is supposed to have the effect of con¬ 
tinuing it to the next line. This can be expected to work in string constants and 
macro definitions on any C compiler, either ANSI or traditional. (See the previous 
exercise.) However, not all ANSI C compilers support this in a more general way. 
After all, except in macro definitions, this construct gets little use. Does your C 
compiler support this in a general way? Try the following: 

#i nc\ 

lude <stdio.h> 

int mai\ 
n(void) 

{ 

printf("Will this work?\n"); 
ret\ 
urn 0; 

} 

30 When you invoke the compiler, the system first invokes the preprocessor. In this 
exercise we want to deliberately make a preprocessing error, just to see what hap¬ 
pens. Try the following program: 

#incl <stdixx.h> /* two errors on this line */ 

int main(void) 

{ 

printf("Try me.\n"); 
return 0; 

} 

What happens if you change #i ncl to #i ncl ude? 
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We begin this chapter with a brief look at declarations, expressions, and assignment. 
Then we give a detailed explanation for each of the fundamental data types, paying par- 
£ titular attention to how C treats characters as small integers. In expressions with oper¬ 
ands of different types, certain implicit conversions occur. We explain the rules for 
conversion and examine the cast operator, which forces explicit conversion. 


3.1 Declarations, Expressions, and Assignment 


*■■■'■ ..’M 


Variables and constants are the objects that a program manipulates. In C, all variables 
must be declared before they can be used. The beginning of a program might look like 
this: 


#include <stdio.h> 
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int main(void) 

{ 

int a, b, c; 

float x, y = 3.3, z 


7.7 


printf("Input two integers: "); 
scanf("%d%d", &b, &c); 
a = b + c; 
x = y + z; 


/* declaration */ 

/* declaration with 
initializations */ 

/* function call */ 

/* function call */ 

/* assignment */ 

/* assignment */ 
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Declarations associate a type with each variable that is declared, and this tells the com¬ 
piler to set aside an appropriate amount of space in memory to hold values associated 
with variables. This also enables tbe compiler to instruct the machine to perform speci¬ 
fied operations correctly. In the expression b + c, the operator + is being applied to 
two variables of type i nt, which at the machine level is a different operation than + 
applied to variables of type float, as occurs in the expression y + z. Of course, the 
programmer need not be concerned that the two + operations are mechanically dif fer¬ 
ent, but the C compiler has to recognize the difference and give the appropriate 
machine instructions. 

The braces { and > surround a block, which consists of declarations and statements. 
The declarations, if any, must occur before the statements. The body of a function defi¬ 
nition is a block, but as we shall see in Section 5.10, “Scope Rules,” on page 213, there 
are other uses for blocks. 

Expressions are meaningful combinations of constants, variables, operators, and 
function calls. A constant, variable, or function call by itself can also be considered an 
expression. Some examples of expressions are 

a + b 

sqrt(7.333) 

5.0 * x - tan(9.0 / x) 

Most expressions have a value. For example, the expression a + b has an obvious value, 
depending on the values of the variables a and b. If a has value 1 and b has value 2, 
then a + b has value 3. 

The equal sign = is the basic assignment operator in C. An example of an assignment 
expression is 

i = 7 

The variable i is assigned the value 7, and the expression as a whole takes that value as 
well. When followed by a semicolon, an expression becomes a statement, or more 
explicitly, an expression statement. Some examples of statements are. 


printf("The plot thickens!\n"); 

The following two statements are perfectly legal, but they do no useful work. Some 
compilers will issue warnings about such statements; others will not. 
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Let us consider a statement that consists of a simple assignment expression followed 
Sr » semicolon. It will have the following form: 


jj variable = expr ; 

First, the value of the expression on the right side of the equal sign is computed. Then 
that value is assigned to the variable on the left side of the equal sign, and this becomes 
the value of the assignment expression as a whole. (Statements do not have a value.) 
Note that here the value of the assignment expression as a whole is not used. That is 
perfectly all right. The programmer is not required to use the value produced by an 
expression. 

r Even though assignment expressions sometimes resemble mathematical equations, 
the two notions are distinct and should not be confused. The mathematical equation 


does not become an assignment expression by typing 


/* wrong */ 


Here, the left side of the equal sign is an expression, not a variable, and this expression 
may not be assigned a value. Now 7 consider the statement 

jj x = x + 1; 

The current value of x is assigned the old value of x plus 1. If the old value of x is 2, 
then the value of x after execution of the statement will be 3. Observe that as a mathe¬ 
matical equation 


x + 1 


s meaningless; after subtracting x from both sides of the equation, we obtain 


Caution: Although they look alike, the assignment operator in C and the equal sign in 
mathematics are not comparable. 
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3,3 T Characters and the Data Type char 


111 




3.2 The Fundamental Data Types 

C provides several fundamental data types, many oi which we have already seen. We uj| 
need to discuss limitations on what can be stored in each type. 



char 

signed short int 
unsigned short int 
float 


signed char 
signed int 
unsigned int 
double 


unsigned char 
signed long int 
unsigned long int 
long double 






These are all keywords They may not be used as names of variables. 01 course, char j 
Ifandstor S and i nt stands for -integer,- but only char and int can be used| 
as keywords. Other data types such as arrays, pointers, and structures are derived from 
the fundamental types. They are presented in later chapters. 

Usually, the keyword signed is not used. For example, signed int is equivalent tcjj| 
i nt, and because shorter names are easier to type, i nt is typically used. The type charts? 
however, is special in this regard. (See Section 3.3, “Characters and the Data Typet 
char,” on page 111 .) Also, the keywords short int, long int, and unsigned i nt may : g 
be, and usually are, shortened to just short, long, and unsigned, respectively. Thej| 
keyword signed by itself is equivalent to i nt, but it is seldom used in this context,^ 

With all these conventions, we obtain a new list. 


Fundamental data types j 

char 


signed char 

unsigned char 

short 


i nt 

long 

unsigned short 

unsigned 

unsigned long 

float 


double 

long double 




■m 
:: 




Let us assume that the category type is defined to be any one of the fundament i y 
types given in the preceding table. Using this category, we can provide the syntax o a 

simple declaration: 

declaration : := type identifier { , identifier }@ + , 


/Z. S' \ 


The fundamental types can be grouped according to functionality. The integral types 
jp : are those types that can be used to hold integer values; the floating types are those that 
sSJSlfcin be used to hold real values. They are all arithmetic types. 
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Fundamental types grouped by functionality 

Integral types 

char signed char unsigned char 

short int long 

unsigned short unsigned unsigned long 

Floating types 

float double long double 

Arithmetic types 

Integral types + Floating types 


mmp. These collective names are a convenience. In Chapter 6, for example, when we dis- 
Igjfcuss arrays, we will explain that only integral expressions are allowed as subscripts, 
ll&meaning only expressions involving integral types are allowed. 


matte 








Characters and the Data Type char 
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gg%In C, variables of any integral type can be used to represent characters. In particular, 
both char and i nt variables are used for this purpose. In some situations, an i nt may 
jjjjbe required for technical reasons. (See Section 3.9, “The Use of getchar () and 
IJjljiutchar (),” on page 124.) Constants such as 1 a 1 and ' + ' that we think of as charac- 
>t(- ters are of type i nt, not of type char. There are no constants of type char in C. This is 
1111 one of the few places where C++ differs from C. In C++, character constants are of type 
char. (See exercise 14, on page 142.) 

In addition to representing characters, a variable of type char can be used to hold 
|jj|Small integer values. Each char is stored in memory in 1 byte. Other than being large 
HJSenough to hold all the characters in the character set, the size of a byte is not specified 
|§|gih C. However, on most machines a byte is composed of 8 bits and is capable, therefore, 
Hof storing 2 s , or 256, distinct values. Only a subset of these values represents actual 
tf|. printing characters. These include the lower- and uppercase letters, digits, punctuation, 
ggWand special characters such as % and +. The character set also includes the white space 
^characters blank, tab, and newline. 

Most machines use either ASCII or EBCDIC character codes. In the discussion that fol- 
jjflows, we will be using the ASCII code. A table for this code appears in Appendix D, 
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“ASCII Character Codes.” For any other code, the numbers will be different, but the 
ideas are analogous. The following table illustrates the correspondence between some 
character and integer values on an ASCII machine: 


Some character constants and their corresponding integer values 



Observe that there is no particular relationship between the value of the character con 
slant representing a digit and the digit’s intrinsic integer value. That is, the value of ' 2 
is not 2. The property that the values for ' a', ' b', ' c', and so on occur in order is 
important. It makes convenient the sorting of characters, words, and lines into lexico 
graphical order. Character arrays are needed for this kind of work. (See Section 6.10, 
“Strings,” on page 270.) 

Some nonprinting and hard-to-print characters require an escape sequence. The hori 
zontal tab character, for example, is written as \t in character constants and in strings 
Even though it is being described by the two characters \ and t, it represents a single 
character. The backslash character \ is called the escape character and is used to 
escape the usual meaning of the character that follows it. The following table contains 
some nonprinting and hard-to-print characters: 
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backslash 


backspace 
carriage return 
double quote 
formfeed 
horizontal tab 
newline 
null character 
single quote 
vertical tab 
question mark 


The alert character \a is special; it causes the bell to ring. To hear the bell, try executing 
a program that contains the line 


printf("Xc", ' \a'); 


putchar('\a'); 


The double-quote character " has to be escaped if it is used as a character in a string. 


An example is 

W' 3 . 


printf("\"abc\""); 


/* "abc" is printed */ 


m 


Similarly, the single-quote character 1 has to be escaped if it is used in a constant char- 




SBHFTacter construct. 




mmm 




iH 


printf("%cabc%c" , ’\" , 


/* 'abc' is printed */ 


Up* 


mm - 
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Ifnside single quotes we can use either \" or ". 


printf("%cabc%c 


M TV fit fill 
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/* ’abc* is printed */ 
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Ifnside double quotes we can use either V or 1 
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printfC"\'abc’"); 


/* ' abc ' is printed */ 
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In ANSI C, the effect of escaping an ordinary character is undefined. Some compilers 
will complain about this; others will not. 

Another way to write a character constant is by means of a one-, two-, or three-octal¬ 
digit escape sequence, as in ' \007'. This is the alert character, or the audible bell. It 
can be written also as ' \07' or ' \7', but it cannot be written as ' 7'. ANSI C also pro¬ 
vides hexadecimal escape sequences. An example is ' \xla', which is control-z. 

Next, we want to understand how characters are treated as small integers, and, con¬ 
versely, how small integers are treated as characters. Consider the declaration 


char 


» * r , 
d. , 


The variable c can be printed either as a character or as an integer. 


printf("%c", c); 
printf("%d", c); 


/* a is printed */ 
/* 97 is printed */ 


Because c has an integer value, we may use it in arithmetic expressions, 
printf("%c%c%c" , c, c + 1, c + 2); /* abc is printed */ 


Actually, in this regard there is nothing special about the type char. Any integral 
expression can be printed either in the format of a character or an integer. 


char 

int 


for (i = 'a'; i <= 'z'; ++i) 
pri ntf("%c", i); 
for (c = 65; c <= 90; ++c) 
printf("%c", c); 
for (c = '0'; c <= '9'; ++c) 
printf("%d ", c); 


/* abc ... z is printed */ 

/* ABC ... Z is printed */ 

/* 48 49 ... 57 is printed */ 


Next, we want to look at how a char is stored in memory at the bit level, and how 
strings of bits are interpreted as binary numbers. Before we describe this, recall how 
strings of decimal digits are interpreted as decimal numbers. Consider, for example, 
the decimal number 20753. Its value is given by 


2 xlO 4 + Oxf 0 3 + 7 xlO 2 + SxlO 1 + 3x10° 


Observe that each of the exponents corresponds to the count ol a digit in the number 
20753, when we count from zero starting on the right. The digit 3 with count 0 is the 
least significant digit, and the digit 2 with count 4 is the most significant digit. I he gen¬ 
eral form of a decimal number is given by 
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3,5 T The Integral Types short, long, and unsigned 


117 
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3.4 The Data Type int 

The data type i nt is the principal working type of tlie c language. This type, along with 
the other integral types such as char, short, and long, is designed for working with 
the integer values that are representable on a machine. 

In mathematics, the natural numbers are 0,1, 2, 3.and these numbers, along with 

their negatives, comprise the integers. On a machine, only a finite portion of these inte¬ 
gers are representable for a given integral type. 

Typically, an i nt is stored in either 2 bytes ( = 16 bits) or in 4 bytes ( = 32 bits). There 
are other possibilities, but this is what happens in most C systems. On older PCs, an 
i nt is typically stored in 2 bytes. In newer PCs, and in workstations and mainframes, an 
i nt is typically stored in 4 bytes. 

Because the size of an i nt varies from one C system to another, the number of dis¬ 
tinct values that an int can hold is system-dependent. Suppose that we are on a com¬ 
puter that has 4-byte ints. Because 4 bytes equals 32 bits, an int can take on 2 32 
distinct states. Half of these states are used to represent negative integers and half are 
used to represent nonnegative integers: 


+ 1 


3, -2, -1, 0, 1, 2, 3 


If, on the other hand, we are using a computer that has 2-byte words, then an i nt can 
take on only 2 1(> distinct states. Again, half of these states are used to represent nega¬ 
tive integers, and half are used to represent nonnegative integers: 


+ 1 


1 , 0 , 1 


Let N miri iri | represent the smallest integer that can be stored in an i nt, and let N ma xj nt 
represent the largest integer that can be stored in an i nt. If i is a variable of type i nt, 
the range of values that i can take on is given by 


'mi n_i nt 


< i < N 


max_int 


with the end points of the range being machine-dependent. The typical situation is as 
follows: 

On machines with 4-byte words: 


‘mi n_int 
•max i nt 


= -2 31 = -2147483648 - -2 billion 

= +2 31 - 1 = +2147483647 « +2 billion 



yn machines with 2-byte words: 


t15 

mi n_int - ^ 1S 

max.i nt - 


= -32768 
1 = +32767 


-32 thousand 
+32 thousand 


>n any machine, the following code is syntactically correct: 


#define BIG 2000000000 


/* 2 billion */ 




int main(void) 
{ 

int a, b 
a = b + c; 


BIG, c = BIG; 

/* out of range? */ 


I * * * 
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. However, at run-time the variable a may he assigned an incorrect value. The logical 
Kliyalue of the expression b + c is 4 billion, which is greater than N max int . This condition 
HjSpis called an integer overflow. Typically, when an integer overflow occurs, the program 
discontinues to run, but with logically incorrect results. For this reason, the programmer 
JBjpust strive at all times to keep the values of integer expressions within the proper 
range. 

jjjis in addition to decimal integer constants, there are hexadecimal integer constants 
jjjf such as 0xla and octal integer constants such as 0377. (See Section 3.12, "Hexadecimal 
Jglgid Octal Constants,” on page 134.) Many C programmers have no particular need for 
1§§ hexadecimal and octal numbers, but all programmers have to know that integers that 
begin with a leading 0 are not decimal integers. For example, 11 and 011 do not have 
it-the same value. 
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3.5 The Integral Types short, long, and unsigned 


m— 
















In C, the data type i nt is considered the “natural” or “usual” type for working with inte¬ 
gers. The other integral types, such as char, short, and long, are intended for more 
specialized use. The data type short, for example, might be used in situations where 
storage is of concern. The compiler may provide less storage for a short than for an 
int, although it is not required to do so. In a similar fashion, the type long might be 
used in situations where large integer values are needed. The compiler may provide 
more storage for a long than for an int, although it is not required to do so. Typically, 
a short is stored in 2 bytes and a long is stored in 4 bytes. Thus, on machines with 4- 
byte words, the size of an int is the same as the size of a long, and on machines with 














11 8 Chapter 3 ▼ The Fundamental Data Types 

2-byte words, the size of an i nt is the same as the size of a short. If s is a variable ofc 
type short, then the range of values that s can take on is given by jj 

Nmin short - small < p 


where typically 


*mi rushort 


‘max. 


short ~ 


-32768 

+32767 


~32 thousand 
+32 thousand 


If bi g is a variable of type 1 ong, then the range of values that bi g can take on is given 
by f 

NminJlong ^ big < N max _] ong 




3.6 Y The Floating Types 


Combining long and unsigned 


Example 


Type 


unsigned 


long 


unsigned long 37UL 


where typically 


iiSI ' 




‘mi n_1 ong 
^max_long 


= +2 


= -2147483648 
1 = +2147483647 


- 2 billion 
+2 billion 



The Floating Types 


A variable of type unsigned is stored in the same number of bytes as an int. How¬ 
ever, as the name implies, the integer values stored have no sign. Typically, variables of g 
type int and unsigned are stored in a machine word. If u is a variable of type-jj 
unsi gned, then the range of values u can take on is given by |i 


0 < u < 2 


wordsize 


The typical situation is as follows: On machines with 4-byte words 


Wx_unsigned 


2 32 - 1 = +4294967295 - +4 billion 
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sfSI C provides the three floating types: float, double, and long double. Variables 
iflpif this type can hold real values such as 0.001, 2.0, and 3.14159. A suffix can be 
*Hippended to a floating constant to specify its type. Any unsuffixed floating constant is 
type double. Unlike other languages, the working floating type in C is double, not 
oat. 


Combining float and unsigned 


Suffix 


f or F 
1 or L 


Type 


float 


Example 


3.7F 


On machines with 2-byte words 


'max_unsigned 


1 = +65535 


+65 thousand 


fplk 

WM. 

§m 1 

mm 

WfeWm 


long double 3.7L 


Arithmetic on unsigned variables is performed modulo 2 wordsize . (See exercise 16, on 
page 143.) 

Suffixes can be appended to an integer constant to specify its type. The type of an 
unsuffixed integer constant is either int, long, or unsigned long. The system 
chooses the first of these types that can represent the value. For example, on machines 
with 2-byte words, the constant 32000 is of type i nt, but 33000 is oi type long. 



Integers are representable as floating constants, but they must be written with a deci¬ 
mal point. For example, the constants 1.0 and 2.0 are both of type doubl e, whereas 
the constant 3 is an i nt. 

In addition to the ordinary decimal notation for floating constants, there is an expo- 
jiential notation, as in the example 1.234567e5. This corresponds to the scientific 
IJnotation 1.234567 x 10 s . Recall that 

% 1.234567 x 105 = 1.234567 x 10 x 10 x 10 x 10 x 10 
| = 1.234567 x 100000 

| = 123456.7 (decimal point shifted five places') 

j§to a similar fashion, the number 1.234567e-3 calls for shifting the decimal point three 
Sfplaees to the left to obtain the equivalent constant 0.001234567. 
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Now we want to carefully describe the exponential notation. After we give the precise 
rules, we will show some examples. A floating constant such as 333.77777e~22 may 
not contain any embedded blanks or special characters. Each part of the constant is 

given a name: 

. . .. . ■■ M . Jinuum iTiTllTmTimillW III IIIIIHIll llll llWI IIlM 

i mum ii ii ii ■ 

Floating-point constant parts for 333.7777e-22 
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nteger 


333 


Fraction 




Exponent 


e-22 


[.suffix 


Some examples of floating constants are 


3.14159 

314.159e-2F 

0e0 


/* of type float */ 

/* equivalent to 0.0 */ 

/* equivalent to 1.0, but harder to read */ 


but not 


3.14,159 
314159 
. e0 

-3.14159 


/* comma not allowed */ 

/* decimal point or exponent part needed */ 
/* integer or fractional part needed */ 

/* this is floating constant expression */ 


Mm A 

p7'. :v;. ■?&* 


: 




A floating constant may contain an integer part, a decimal point, a fractional part, and 
an exponential part. A floating constant must contain either a decimal point or an expo¬ 
nential part or both. If a decimal point is present, either an integer part or fractional 
part or both must be present. If no decimal point is present, then there must be an inte¬ 
ger part along with an exponential part. 

floating ..constant : := [.constant {[.suffix} opt 
[.constant : := i.part . f.part e.part 

i.part . f-part 
L part 

| . [.part 

. f.part e.part 

Lpart e.part 

i.part ::= inte.ger.parl ::= {digit} 1+ 
f.part ::= fractionaLpart : := {digit} 1+ 

e. part ::= exponentiai.part ::= {e|E} ± {+|-} opt {digit} i + 

f. suffix : : = floating.suffix :: = f | F | 1 I L 
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Typically, a C compiler will provide more storage for a variable of type double than 
for one of type f 1 oat, although it is not required to do so. On most machines, a f1 oat 
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stored in 4 bytes, and a doubl e is stored in 8 bytes. The effect of this is that a float 
{tores about 6 decimal places of accuracy, and a doubl e stores about 15 decimal places 
If accuracy. An ANSI C compiler may provide more storage for a variable of type 
long doubl e than for one of type doubl e, though it is not required to do so. Many 
llmpilers implement a 1 ong doubl e as a doubl e. (See exercise 15, on page 143.) 

The possible values that a floating type can he assigned are described in terms of 
attributes called precision and range. The precision describes the number of significant 
decimal places that a floating value carries. The range describes the limits of the largest 
and smallest positive floating values that can be represented in a variable of that type. 
A float on many machines has an approximate precision of 6 significant figures and 
an approximate range of 10~ 38 to 10 +3S . This means that a positive float value is rep¬ 
resented in the machine in the form (only approximately true) 

| 0.d 1 d 2 d 3 d,,d 5 d 6 x 10 n 

where each d ; is a decimal digit; the first digit, d,, is positive and -38 x n x +38. The 
representation of a float value in a machine is actually in base 2, not base 10, but the 
ideas as presented give the correct flavor. 

A double on many machines has an approximate precision of 15 significant figures 
and approximate range of 10“ 308 to 10 +308 . This means that a positive double value is 
represented in the machine in the form (only approximately true) 


0. did- 

J. i 


• dii; X 10 1 


where each dj is a decimal digit, the first digit, dj, is positive; and -308 < n < +308 
Suppose x is a variable of type doubl e. Then the statement 


123.45123451234512345; 


/* 20 significant digits */ 


will result in x being assigned a value that is stored in the form (only approximately 
true) 


0.123451234512345 x 10' 


(15 significant digits') 


The main points you must be aware of are (1) not all real numbers are representable, 
and (2) floating arithmetic operations, unlike the integer arithmetic operations, need 
not be exact. For small computations this is usually of no concern. For very large com¬ 
putations, such as numerically solving a large system of ordinary differential equations, 
| a good understanding of rounding effects, scaling, and so on may be necessary. This is 
jp the domain of numerical analysis. 
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3.7 The Use of typedef 










The C language provides the typedef mechanism, which allows the programmer to 
explicitly associate a type with an identifier. Some examples are 


typedef 

typedef 

typedef 


char 

int 

unsigned long 


uppercase; 
INCHES, FEET 
sizet; 






/* found in stddef.h *Jjj 


In each of these type definitions, the named identifiers can be used later to declare vaijj 
ables or functions in the same way ordinary types can be used. Thus, f 


uppercase 

INCHES 


u 


length, width; 


declares the variable u to be of type uppercase, which is synonymous with the type 
char, and it declares the variables length and width to be of type INCHES, which is 
synonymous with the type i nt. 

What is gained by allowing the programmer to create a new nomenclature for an 
existing type? One gain is in abbreviating long declarations. Another is having type; 
names that reflect the intended use. Furthermore, if there are system-sensitive declares 
tions, such as an i nt that is 4 bytes on one system and 2 bytes on another, and these; 
differences are critical to the program, then the use of typedef may make the porting- 
of the software easier. In later chapters, after we introduce enumeration types andj 
structure types, we will see that the typedef facility gets used routinely. 


m 






3.8 The sizeof Operator 


''.vonSfifr 


C provides the unary operator si zeof to find the number of bytes needed to store aitS 
object. It has the same precedence and associativity as all the other unary operators. An | 
expression of the form 


■■ - :c d#rm 
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mpm* hrteger that represents the nurnber of bytes need-eel to store the object in 
ory. An object can be a type such as i nt or f 1 oat, or it can be an expression such 
|g|a + b, or it can be an array or structure type. The following program uses tills oner- 
fcr. On a given machine it provides precise information about the storage require- 
llinents for the fundamental types. 




BP* Compute the size of some fundamental types. */ 
#include <stdio.h> 


mm 

sSii-.- 


fflS-int main (void) 

{ 

printf(The size of some fundamental types is computed.\n\n") 




Ifife 


y : -v> 

fe-- ■; 
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printf (” 
printf ( M 
printf ( ,r 
pri ntf (" 
printf(” 
printf( n 
printf(" 


} 


char:%3u byte \n", sizeof(char)); 
short:%3u bytes\n", sizeof(short)); 
int.*%3u bytes\n", sizeof(int)); 
long:%3u bytes\n", sizeof(long)); 
unsigned:%3u bytes\n", sizeof(unsigned)); 
float:%3u bytes\n", sizeof(float)); 
double:%3u bytes\n", sizeof(double))• 

printf("long double:%3u bytes\n", sizeof(long double))- 
return 0; ' ’ 


^^fcause the C language is flexible in its storage requirements for the fundamental 
the situation can vary from one machine to another. However, it is guaranteed 


TV C 




Wm f 




sizeof(char) = 1 

sizeof(char) <= sizeof(short) < sizeof(int) < sizeof(long) 
sizeof(signed) = sizeof(unsigned) = sizeof(int) 
sizeof(float) < sizeof(double) < sizeof(long double) 


j|S| the signed and unsigned versions of each of the integral types are guaranteed to 
Jjghave the same size. 

||t| Notice that we wrote si zeof (...) as if it were a function. It is not, however—it is an 
^operator. If sizeof is being applied to a type, then parentheses are required; otherwise 
gpey are optional. The type returned by the operator is typically unsigned 
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sizeof (object) 
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3.9 The Use of getcharO and putcharO 

In this section, we illustrate the use of getcharO and putcharO. These are macros 
defined in stdio.h that are used to read characters from the keyboaid and to print char¬ 
acters on the screen, respectively. Although there are technical differences, a macro is. 
used as a function is used. (See Section 8.6, “.An Example: Macros with Arguments, on 
page 377.) These macros, as well as others, are often used when manipulating character 

data. 

In memory, a char is stored in 1 byte, and an i nt is stored typically in either 2 or 4 
bytes. Because of this, an i nt can hold all the values that can be stored in a char, and 
more. We can think of a char as a small integer type, and, conversely, we can think of 
an i nt as a large character type. This is a fundamental idea, and, unfortunately, a diffi¬ 
cult one for beginning C programmers. 

Our next program is called double_out. It reads characters one after another from the 
standard input file, which is normally connected to the keyboard, and writes each char¬ 
acter twice to the standard output file, which is normally connected to the screen. 

In file double.out.c 

#include <stdio.h> 

int main(void) 

{ 

int c; 

while ((c = getcharO) ■= EOF) { 
putchar(c); 
putchar(c); 

} 

return 0; 

} 





3.9 v The Use of getcharO and putcharO 125 




#define EOF (-1) 


Jjjjlhe identifier EOF is mnemonic for “end-of-file.” What is actually used to signal an end- 
llllif'file mark is system-dependent. Although the i nt value -1 is often used, different 
IJljltems can have different values. By including the file stdio.h and using the symbolic 
jflponstant EOF, we have made the program portable. This means that the source file can 
■jpljlprioved to a different system and run with no changes. The header file stdio.h also 
Jjfljlntains the macro definitions of getcharO and putcharO. 

v/ 

^j|i int c; 

The variable c has been declared as an i nt rather than a char. Whatever is used to sig- 
ffejjjf the end of a file, it cannot be a value that represents a character. Because c is an 
Jjftnf:, h can hold all possible character values as well as the special value EOF. 

while C(c = getcharO) != EOF) { 


expression 


(c = getcharO) != EOF 


Jjls(composed of two parts. The subexpression c = getcharO gets a value from the 
ijj§keyboard and assigns it to the variable c, and the value of the subexpression takes on 
jjjjhat value as well. The symbols != represent the “not equal” operator. As long as the 
of the subexpression c = getcharO is not equal to EOF, the body of the while 
, t*T-5§op is executed. The parentheses around the subexpression c = getcharO are neces- 
Suppose we had left out the parentheses and had typed 

~ *' '' 

— ___I_ \ \ r~ r~ 




c = getcharO != EOF 


a&mm a; 


j pjjjfc -Because of operator precedence this is equivalent to 




jppfgc = (getcharO ! 


EOF) 


‘SJcWhich is syntactically correct, but not what we want. 


putchar(c); 


||ffiie value of c is written to the standard output stream in the format of a character. 
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Characters have an underlying integer-valued representation that on most C systems^ 
is the numeric value of their ASCII representation. For example, the character constant | 
' a' has the value 97. The values of both the lower- and uppercase letters occur m j 

order. Because of this, the expression 'a' + 1 has the value , e e *Pr 

- b ' + 1 has the value 1 c', and so on. Also, because there are 26 letters m the alpha- | 
bet, the expression ' z' - ' a' has the value 25. Consider the expression A a • t^ 

has a value that is the same as ' B' - ' b', which is the same as ’ 

Because of this, if the variable c has the value of a lowercase letter, then the expression 
c + ' A' - ' a' has the value of the corresponding uppercase letter. I hese ideas are 
incorporated into the next program, which capitalizes all lowercase letters. 

In file capitalize.c 

#include <stdio.h> 

int main(void) 

{ 

int c; 

while ((c = getcharO) != EOF) 
if (c >= 'a' && c <= 'z') 
putchar(c + 'A' - 'a'); 
el se 

putchar(c); 
return 0; 

} 

Because of operator precedence, the expressions 


c >= 'a' && c <= 'z' and (c >= ’a’)'*’ (c <- 'z ) 


are equivalent. The symbols <= represent the operator “less than or equal.” The subex¬ 
pression c >= 'a' tests to see if the value c is greater than or equal to the value of 


The subexpression c <= ' z' tests to see if the value of c is less than or equal to the 
value ’ z'. The symbols && represent the operator “logical and.” If both subexpressions 


are true, then the expression 


c >= 'a' && c <= 'z' 

is true; otherwise it is false. Thus, the expression is true if and only d c is a lowercase 
letter. If the expression is true, then the statement 


putchar(c + 'A' - 'a'); 

is executed, causing the corresponding uppercase letter to be printed. 
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3,10 Mathematical Functions 

There are no built-in mathematical functions in C. Functions such as 


- 


sqrt() pow() exp() log() sin() cos() 


tan() 


lire available in the mathematics library, which is conceptually part of the standard 
PSllibrary. All of these functions, except the power function pow(), take a single argument 
gjgjftype double and return a value of type double. The power function takes two argu- 
B^hents of type doubl e and returns a value of type doubl e. Our next program illustrates 
Ethe use of sqrt() and pow(). It asks the user to input a value for x and then prints it 
Hmt, along with the square root of x and the value of x raised to the x power. 
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file power_square,c 

#include <math.h> 

| #iDelude <stdio.h> 

int main(void) 


TTi TT ■; v"‘ )'•''' 
yi .*-\ ,< y 


double x; 
printf("\n%s", 

"The following will be computed:\n" 
"\n" 

" the square root of x\n" 

" x raised to the power x\n" 

, "\n"); 


while ( 1 ) { 

printf("Input x: "); 
if (scanf("%lf", &x) != 1 ) 
break; 

if (x >= 0 . 0 ) 

printf ("\n%15s%22.15e\n%15s%22.15e\n%15s%22.15e\n\n", 

*' V — 11 V 

A — f A f 

"sqrt(x) = ", sqrt(x), 

"pow(x, x) = ", pow(x, x)); 

else { 

printf("\nSorry, your number must be nonnegative.\n"); 


X 
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printf("\nBye!\n\n"); 
return 0; 
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If we execute the program and enter 2 when prompted, here is what appears on the 


screen: 



The following will be computed 

the square root of x 
x raised to the power x 


Input x: 2 

x 

sqrt(x) 
pow(x, x) 

Input x: 
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2 .0000000000000006+00 
1.414213562373095e+00 

4.000000000000000e+00 





ppi 

pm 

mm m-mf 


€V9m 

lit! SS5? 


Eli 

IHlilBI 


JSSSi 


Dissection of the sqrt^pow Program 

■ #include <math.h> 

#include <stdio.h> j 

These header files contain function prototypes. In particular, math.h contains the pro* 
totypes for the functions in the mathematics library. Although it is i 

to do so, as an alternative to including math.h, we can supply our own f P 

types: 

s double sqrt(double), pow(double, double), 

This declaration should be placed in the file just above mainQ. (Some compilers will 
complain if the function prototype is placed in the body ol man n () itself.) 

■ while (1) { 

• * * * • 

Because any nonzero value is considered to be true, the expression 1 creates an infinite 
whi 1 e loop. We will use a break statement to exit the loop. 
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scanf("%lf”, &x) 

The format %lf is used in the control string because x is a doubl e. A common error is 
'to use %f instead of %1 f. Notice that we typed 2 when we illustrated the use of this pro¬ 
gram. Equivalently, we could have typed 2.0 or 2e0 or 0.2el. The function call 
scanf ("%I f" > &x) would have converted each of these to the same double. In C 
source code, 2 and 2.0 are different. The first is of type i nt, and the second is of type 
double. The input stream that is read by scanf () is not source code, so the rules for 
source code do not apply. When scanf () reads in a double, the number 2 is just as 
»ood as the number 2.0. (See Section 11.2, “The Input Function scanf (),” on page 
99.) 

i if (scanf("%lf", &x) != 1) 
break; 

Time function scanf () returns the number of successful conversions. To exit the whi 1 e 
[loop we can type “quit” or anything else that scanf () cannot convert to a double. If 
the value returned by scanf O is not 1, the break statement gets executed. This causes 
'prog ram control to exit the while statement. We discuss the break statement more 
fully in Chapter 4, “Flow of Control.” 

J if (x >= 0.0) 


mM si >r' 







Ilf "Because the square root function is defined only for nonnegative numbers, a test is 
made to ensure that the value of x is nonnegative. A call such as sqrt(-1.0) can cause 
a run-time error. (See exercise 20, on page 144.) 

I printf("\n%15s%22.15e\n%15s%22.15e\n%15s%22.15e\n\n", 

"x = ", x, 

I, "sqrt(x) = ", sqrt(x), 

Si "pow(x, x) = ", pow(x, x)); 

% Notice that we are printing doubl e values in the format %22.15e. This results in 1 place 

P&P-'f'i:'.; "V 

p) to the left of the decimal point and 15 places to the right, 16 significant places in all. On 
8|gpur machine, only n places are valid, where n is between 15 and 16. (The uncertainty 
comes about because of the translation from binary to decimal.) You can ask for lots of 
®Pdecimal places to be printed, but you should not believe all that you read. 

















1 32 Chapter 3 ▼ The Fundamental Data Types 


gp 7 . 




mmmm 
■ ■ ■ 

iillSiBl 


Is ; ' BH | 


3.11 ▼ Conversions and Casts 


The usual arithmetic conversions: 


If either operand is of type 1 ong doubl e, the other operand is converted to 1 ong double. 

Otherwise, if either operand is of type doubl e, the other operand is converted to doubl el 

Otherwise, if either operand is of type float, the other operand is converted to float. j 

Otherwise, the integral promotions are performed on both operands, and the following 
rules are applied: 

If either operand is of type unsigned long, the other operand is converted to 
unsigned long. | 

Otherwise, if one operand has type 1 ong and the other has type unsigned, then on§j 
of two possibilities occurs: § 

If a long can represent all the values of an unsigned, then the operand of 
type unsigned is converted to long. jf 

If a long cannot represent all the values of an unsigned, then both of the 
operands are converted to unsigned long. J 

Otherwise, if either operand has type long, the other operand is converted to long.; 

Otherwise, if either operand has type unsigned, the other is converted to 
unsigned. 

Otherwise, both operands have type i nt. J 

This arithmetic conversion has several alternate names: |j 

b automatic conversion JJ 

■ implicit conversion 1 

a coercion S 

■ promotion f 

a widening 


iiH - v m 
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following table illustrates the idea of automatic conversion: 


Declarations 
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char c; 

short s; 

int i; 


long 1; 

unsigned u; 

unsigned long 

ul; 

float f; 

double d; 

long double 

Id; 






c - s / i 


u * 2.0 - i 


c + 3 


c + 5.0 


d + s 


2 * i / 1 


Type 


int 


double 


int 


double 


double 


long 


Expression 



- i 


•p "k y _ -j 


7 * s * ul 


Id + c 


u - ul 


u - 1 


unsigned 


float 


unsigned long 


long double 


unsigned long 


syste 



-M 


In addition to automatic conversions in mixed expressions, an automatic conversion 
Ifgfcalso can occur across an assignment. For example, 




; d = i 

causes the value of i, which is an i nt, to be converted to a doubl e and then assigned to 
d, and double is the type of the expression as a whole. A promotion or widening such 
fas d = i will usually be well behaved, but a narrowing or demotion such as i = d can 
lose information. Here, the fractional part of d will be discarded. If the remaining inte- 
fger part does not fit into an i nt, then what happens is system-dependent. 


(F* _ _ _ 

Ldbl5 


ISM! 

v; 


In addition to implicit conversions, which can occur across assignments and in mixed 
1® expressions, there are explicit conversions called casts. If i is an i nt, then 




(double) i 


§gjjr will cast, or convert, the value of i so that the expression has type doubl e. The variable 
f i itself remains unchanged. Casts can be applied to expressions. Some examples are 
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(long) ('A' + 1.0) 
f = (float) ((int) d + 1) 
d = (double) i / 3 
(double) (x = 77) 

but not 





88888 *' 
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= A x 16 4 + 0 x 16 3 + F x 16 2 + 3 x 16 1 + C x 16® 

= 10 x 16 4 + 0 x 16 3 + 15 x 16 2 + 3 x 16 1 + 12 x 16® 
= 659260 


(double) x = 77 /* equivalent to ((double) x) = 77, error */ 

The cast operator (type) is a unary operator having the same precedence and right-to 
left associativity as other unary operators. Thus, the expression 



hexadecimal numbers and their decimal equivalents are given in the following 
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Hexadecimal number 


(float) i + 3 


is equivalent to 


((float) i) + 3 


113 


Conversion to decimal 


2 x 16 + A = 2 x 16 + 10 = 42 
B x 16 + 3 = 11 x 16 + 3 = 179 
1 x 16 2 + 1 x 16 + 3 = 275 


because the cast operator (type) has higher precedence than +. 
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3.12 Hexadecimal and Octal Constants 

A number represented by a positional notation in base 16 is called a hexadecimal num¬ 
ber. There are 16 hexadecimal digits. 

Hexadecimal digits and their corresponding decimal values 


Wmm 




T On machines that have 8-bit bytes, a byte is conveniently represented as two hexa¬ 
decimal digits. Moreover, the representation has two simultaneously valid interpreta¬ 
tions. First, one may consider the 8 bits in a byte as representing a number in base 2 
notation. That number can be expressed uniquely as a hexadecimal number with two 
hexadecimal digits. The following table lists 8-bit bytes and corresponding two-digit 
hexadecimal numbers. For convenience, decimal numbers are listed, and for later refer¬ 
ence octal numbers are also listed. 


MM 


m 


Hexadecimal digit: 
Decimal value: 


. y 


. 9 




jpfpf 




12 13 14 15 



Decima 




A positive integer written in hexadecimal notation is a string of hexadecimal digits of 
the form 


mg 


iSiS 



hv >]i 


n H n -1 4 * 




§pp 




where each hf is a hexadecimal digit. It has the value 

h„xl6” + h„j 1 xl6' 1_1 + ... + h 2 xl6 2 + hjxlG 1 + h 0 xl6 ( 





188 


254 

ito M IX X. 

255 


Binary 


00000000 

00000001 

00000010 

00000011 


00011111 

00100000 


10111100 


11111110 

11111111 


Hexadecimal 


Octa 


000 

001 

002 

003 


037 

040 


274 


376 

377 


For example, 


Another interpretation of this correspondence is also useful. By definition, a nibble 
consists of 4 bits, so a byte is made up of 2 nibbles. Each nibble has a unique represen- 
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tation as a single hexadecimal digit, and 2 nibbles, making up a byte, are representabl 
as 2 hexadecimal digits. For example, 


1011 1100 


corresponds to 


Note that this same correspondence occurs in the table. All of this is use u w en 

manipulating the values of variables in bit form. 

The octal digits are 0, 1, 2, ... , 7. A positive integer written in octal notation is a 
string of digits of the form 

Oyi Ofi-n * * * ^2 °o 

where each o ; is an octal digit. It has the value 

o n x8 n + o fl _! x8 n_1 + .. . + o 2 x8 2 + 0ix8 3 + 0o x8 ° 


For example, : ';m® 1 

75301 = 7x 8 4 + 5x 8 3 + 3x 8 2 + 0x 8 1 + 1x8 

On machines that have words consisting of 24 or 48 bits it is natural to have words* 

consisting of “bytes” with 6 bits, each “byte made upo m es o , „ ' 

this case a “nibble” has a unique representation as a single octal digit, a > 

a unique representation as two octal digits. 'Alii 

In C source code, positive integer constants prefaced with 0 ^present integers 1 |g 

octal notation, and positive integer constants prefaced with 0x or 0 ^^ esen ^^ Ml 
in hexadecimal notation. Just as with decimal integer constants octal and hexad Jg 

constants may have suffixes appended to specify the type. The letters A through F and .. |J 

a through f are used to code hexadecimal digits. 

hexadecimalJntegerconstant ::= hjnteger constant {J-fuff lx >opt 'M 
h_integer^constant ::= { 0x | 0X h { hexadecimal-digit } 1+ 

hexadecimaLdigit ::= 0 I 1 I I 9 j a 1 ^ 1 ;'' 4fS 

octaL integer constant :: = o.integer.constant 1 i.sufi j opt ; ||B 

oJntegerlconstant : := 0 { octaLdigit >i+ ;|| 

octaLdigit :: = 0 i 1 I ■ ■ ■ I 7 AiSp 

Lsuffix ::= integer.suffix :: = { u I U } opt i 1 I L ->opt m 

Let us write a program to illustrate these ideas. We will show the output of each A 
pri ntf () statement as a comment. . J| 
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/* Decimal, hexadecimal, octal conversions. */ 
#include <stdio.h> 
int main(void) 

printf("%d %x %o\n", 19, 19, 19); 
printf("%d %x %o\n", 0xlc, 0xlc, 0xlc); , 

printf("Xd %x %o\n", 017, 017, 017); 
printf("%d\n", 11 + 0x11 + 011); 
printf("%x\n", 2097151); 
printf("%d\n", OxlFfFFf); 
return 0; 


/* 19 13 

/* 28 lc 
/* 15 f 
/* 37 
/* lfffff 
/* 2097151 


23 */ 
34 */ 


On machines with 2-byte i nts, the last tw'o formats must be changed to %1 x and %1 d, 
respectively. The functions pri ntf () and scanf() use the conversion characters and 
d, x, and o in conversion specifications for decimal, hexadecimal, and octal, respec¬ 
tively. With pri ntf (), formats of the form %x and %o cause integers to be printed out 
in hexadecimal and octal notation, but not prefaced with 0x or 0. The formats %#x and 
%#o can be used to get the prefixes. (See Section 11.1, “The Output Function pri ntf ()," 
on page 493, for further discussion.) Caution: When using scanf () to read in a hexa¬ 
decimal number, do not type an 0x prefix. 


C.1B Summary 

| 1 The fundamental data types are char, short, int, long, unsigned versions of 
these, and three floating types. The type char is a 1-byte integral type mostly used 
for representing characters, 

|? 2 The type int is designed to be the “natural” or “working” integral type. The other 
integral types such as short, long, and unsigned are provided for more special¬ 
ized situations. 


3 Three floating types, float, double, and long double, are provided to represent 
real numbers. Typically, a f 1 oat is stored in 4 bytes and a doubl e in 8 bytes. The 
number of bytes used to store a long doubl e varies from one compiler to another. 
However, as compilers get updated, the trend is to store a 1 ong doubl e in 16 bytes. 
The type double, not float, is the “working” type. 
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4 Unlike integer arithmetic, floating arithmetic is not always exact. Engineers and 
numerical analysts often have to take roundoff effects into account when doing 
extensive calculations with floating-point numbers. 

5 The unary operator si zeof can be used to find the number of bytes needed to store 
a type or the value of an expression. For example, si zeof (i nt) is 2 on some older 
small machines and is 4 on most new machines that have 32-bit words. 

6 The usual mathematical functions, such as sinQ, cos(), and tan(), are available! 

in the mathematics library. Most of the functions in the library take a single argu¬ 
ment of type double and return a value of type double. The standard header file! 
math.h should be included when using these functions. 5 

7 Automatic conversions occur in mixed expressions and across an equal sign. Casts 
can be used to force explicit conversions. 

8 Integer constants beginning with 0x and 0 designate hexadecimal and octal inte¬ 
gers, respectively. ( 

9 Suffixes can be used to explicitly specify the type of a constant. For example, 3U is 

of type unsigned, and 7.0F is of type float. <3 

10 A character constant such as 'A' is of type i nt in C, but it is of type char in C++. 
This is one of the few places where C++ differs from C. 


Fvprrkp^ 

1 Not all real numbers are machine-representable; there are too many of them. Thus, 
the numbers that are available on a machine have a “graininess” to them. As an 
example of this, the code 

double x = 123.45123451234512345; 

double y = 123.45123451234512300; /* last two digits zero */ 

printf("%.17f\n%.17f\n", x, y); 


causes two identical numbers to be printed. How many zeros must the initializer 
for y end with to get different numbers printed? Explain your answer. 


'■ -v". 
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HH Xhe mathematical formula 


|| sin 2 (x) + cos 2 (x) = 1 

holds for all x real. Does this formula hold on your machine? Try the following pro- 
gram: 

||!y #include <math.h> 

K finclude <stdio.h> 

int main(void) 

1 { 

g, double two_pi = 2.0 * M_PI; /* in math.h */ 

double h =0.1; /* step size */ 

double x; 


/* in math.h */ 

/* cfpn ciyp k / 


for (x = 0.0; x < two_pi; x += h) 
pri ntf ("965. If: %.15e\n", 

x, sin(x) * sin(x) + cos(x) * cos(x)); 
return 0; 


S What happens if the format %. 15e is changed to %. 15f? Explain. 

■ 

|3 Write a program that prints a table of trigonometric values for sin(), cos(), and 
, tan(). The angles in your table should go from 0 to 2n in 20 steps. 

§4 Write a test program to find out whether the pri ntf () function truncates or 
| rounds when writing a f 1 oat or doubl e with a fractional part. The ANSI standard 
| requires rounding, but some systems do not do this. What happens on your 
fi- machine? 

5 Use the following code to print out a list of powers of 2 in decimal, hexadecimal, 
and octal: 


int 


i, val = 1 


for (i = 1; i < 35; ++i) { 
printf("%15d%15u%15x%15o\n 
val *= 2; 


val, val, val, val); 


What gets printed? Explain. Powers of 2 have a special property when written in 
hexadecimal and octal notation. What is the property? 
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6 What happens when the arithmetic overflows? Try the following code on your sys¬ 
tem: 

int big_big = 2000000000 + 2000000000; 
printf("%d %u\n", big_big, big_big); 

If you are working on a machine with a 2-byte int, change 2000000000 to 32000. 

What gets printed? Explain. Does the output change if big_big is declared to be 
unsigned instead of i nt? 

7 Study the following code carefully wi thout running it: 

printf("Why is 21 + 31 equal to %d?\n", 21 + 31); 

On a machine with 4-byte i nts, here is what gets printed: 

Why is 21 + 31 equal to 5? 

Can you see why? Can you deduce the moral? 

8 In mathematics the Greek letter e, called “epsilon,” is often used to represent a 
small positive number. Although it can be arbitrarily small in mathematics, on a 
machine there is no such concept as “arbitrarily small.” In numerical analysis it is 
convenient sometimes to declare eps (for “epsilon”) as a variable of type double 
and to assign to eps the smallest positive number with the property that 

1.0 < 1.0 + eps 

is true. This number is machine-dependent. See if you can find eps on your 

machine. Begin by assigning the value le-37 to eps. You will find that for this value 
the expression is false. 

9 If you expand the two functions tan(sin(x)) and sin(tan(,\)) in a Taylor series 
about the origin, the two expansions agree for the first seven terms. (If you have 
access to a computer algebra system such as Maple or Mathematica, you can see 
this easily.) This means that the difference of the two functions is very flat at the 
origin. Try the following program: 

#include <math.h> 

#include <stdio.h> 

double f(double x); 



int main(void) 

{ 

double x; 

for (x = -0.25; x <= +0.25; x += 0.01) 
printf("f(%+.2f) =%+.15f\n", x, f(x)); 
return 0; 

> 

double f(double x) 

{ 

return (tan(sin(x)) - sin(tan(x))); 

} 

The output of this program illustrates the flatness of f () near the origin. Do you 
see it? Also, as an experiment remove the plus signs that occur in the formats. You 
will find that the output does not line up properly. 


Most machines use the two’s complement representation to store integers. On these 
machines, the value -1 stored in an integral type turns all bits on. Assuming that 
your system does this, here is one way to determine whether a char is equivalent to 
a si gned char or to an unsi gned char. Write a program that contains the lines 


char c = -1; 
signed char s = -1; 
unsigned char u = -1; 


printf("c = %d 


s = %d u = %d\n", c, s, u); 


Each of the variables c, s, and u is stored in memory with the bit pattern 11111111. 
What gets printed on your system? Can you tell from this what a char is equivalent 
to? Does your ANSI C compiler provide an option to change a plain char to, say, an 
unsi gned char? If so, invoke the option, recompile your program, and run it again. 


Explain why the following code prints the largest integral value on your system: 
unsigned long val = -1; 

printf("The biggest integer value: %lu\n", val); 

Wdiat gets printed on your system? Although technically the value that gets printed 
is system-dependent, on most systems the value is approximately 4 billion. Explain. 













Wife,. 

jllfp 

mg'iHg 4 'ZZMMW&sM 

■ 

'! : '. ’Y : r' Sj ’rr -~!l " 11 ;T "A ' * 

* k; *zs 



1 42 Chapter 3 Y The Fundamental Data Types 


12 A variable of type char can be used to store small integer values. What happens if 
large value is assigned to a char variable? Consider the code 




char cl = 256, c2 = 257; 
printf("cl = %d\nc2 = %d\n", cl, c2); 


/•■ too big! */ 





Your compiler should complain. Docs it? Even if it does complain, it will probably, 
produce executable code. Can you guess what gets printed? * 

13 The following table shows how many bytes are required on most machines to store 
some of the fundamental types. What are the appropriate values for your machine? 
Write and execute a program that allows you to complete the table. 



SI® 




Wmm 


pplkliy'fe... 





Memory required 
on machines with 
4-byte words 


1 byte 


2 bytes 


4 bytes 


4 bytes 


4 bytes 


4 bytes 


8 bytes 


Memory required 
on machines with 
2-byte words 


1 byte 


2 bytes 


2 bytes 


2 bytes 


4 bytes 


4 bytes 


8 bytes 


Memory required 
on your machine 
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Fundamental type 


char 


short 


int 


unsigned 


long 


float 


long double 


14 The type of a character constant such as f A f is different in C and C++. In C, its typeH 
is int, but in C++, its type is char. Compile the following program first in C and|§ 
then in C++. In each case, what gets printed? f 


#include <stdio.h> 

int main(void) 

{ 

printf("sizeof('A') = %u\n", sizeof('A')); 
return 0; 
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Consider the following code: 
char c = 'A'; 

printf("sizeof(c) = %u\n”, sizeof(c)); 

printf("sizeof('A') =%u\n M , sizeof('A')); 

printf("sizeof(c + c) = %u\n", sizeoffc + c)); 

printf("sizeof(c = 'A') = %u\n", sizeoffc = 'A')); 

Write down what you think gets printed; then write a small program to check your 

answer. How many lines of the output change if you compile the program in C++? 

* 

Let NniimuJong anc l ^max_u.long represent the minimum and maximum values that 
can be stored in an unsigned long on your system. What are those values? Hint: 
Read the standard header file limits.h. 

On a 24-hour clock, the zero hour is midnight, and the 23rd hour is 11 o’clock at 
night, one hour before midnight. On such a clock, when 1 is added to 23, we do not 
get 24, but instead we get 0. There is no 24. In a similar fashion, 22 plus 5 yields 3, 
because 22 plus 2 is 0 and 3 more is 3. This is an example of modular arithmetic, or 
more precisely, of arithmetic modulo 24. Most machines do modular arithmetic on 
all the integral types. This is most easily illustrated with the unsigned types. Run 
the following program and explain what gets printed: 

#include <limits.h> /* for UINT_MAX */ 

#include <stdio.h> 

int main(void) 

{ 

int i; 

unsigned u = UINT_MAX; /* typically 4294967295 or 65535 */ 

for (i =0; i <10; ++i) 

printf("%u + %d = %u\n", u, i, u + i); 
for (i =0; i <10; ++i) 

printf("%u * %d = %u\n", u, i, u * i); 
return 0; 

1 

The ANSI C standard suggests that the recommendations of IEEE Standard for 
Binary Floating-Point Arithmetic (ANSI/IEEE Std 754-1985) be followed. Does your 
compiler follow these recommendations? Well, one test is to try to see what hap¬ 
pens when you assign a value to a floating variable that is out of its range. Write a 
small program containing the lines 
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double 

double 


x = le+307; 
y = x * x; 


/* big */ 

/* too big! */ 


printf("x = %e y = %e\n", x, y); 


Does the value for y get printed as Inf of Inf i ni ty? If so, there is a good chance 
that your compiler is following ANSI/IEEE Std 754-1985. 

19 In mathematics, the numbers e and n are well known. The number e is the base of 
the natural logarithms and the number % is the ratio of the diameter of a circle to its 
circumference. Which of the two numbers, e K and n e , is larger? This is a standard 
problem for students in an honors calculus course. However, even if you have never 
heard of e and n and know nothing about calculus, you should be able to answer 
this question. Hint: 


2.71828182845904524 


and 


3.14159265358979324 


20 What happens when the argument to the sqrt() function is negative? In some com¬ 
pilers a call such as sqrt(-1.0) causes a run-time error to occur, whereas in other 
compilers the special value NaN gets returned. The value NaN is called “not a num¬ 
ber." What happens on your system? To find out, write a test program that contains 
the line 

printf("sqrt(-1.0) =%f\n", sqrt(-1.0)); 

21 If the value of x is too large, the call pow(x, x) may cause a run-time error or may 
cause the word Inf or Inf i ni ty to appear on the screen when a pri ntf () state¬ 
ment is used to print the value produced by powfx, x). What is the largest integer 
value for x such that the statement 

printf("pow(%.lf, %.lf) = %.7e\n", x, x, pow(x, x)); 

does not cause a run-time error and does not cause Inf or Inf i ni ty to be printed? 
Hint: Put the statement in a for loop. 

22 In traditional C, any float was automatically promoted to a double. In ANSI C, a 
compiler can promote a float to a double in arithmetic expressions, but is not 
required to do so. We find that most ANSI C compilers perform arithmetic on 
f 1 oats without any widening. What happens on your compiler? Try the following 
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rt into in file that will not get copied to 
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26 In C, the letters used in a hexadecimal constant can be either upper- or lowercase, 
or both. Consider the following code: 

int a = 0xabc; 

int b = 0xABc; j 

int c = 0XABC; 

printf("a = %d b = %d c ~ %d\n", a, b, c); 

Write down what you think gets printed. Then write a program containing these 
lines to check your answer. 

27 The following code is machine-dependent. If when you execute the code it does the 
unexpected, see if you can explain what is happening. 
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char 


0xff 


if (c == 0xff) 

printf( n Truth!\n n ); 
el se 

printf("This needs to be explained!\n"); 
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Statements in a program are normally executed one after another. This is called sequen¬ 
tial flow of control. Often it is desirable to alter the sequential flow of control to provide 
for a choice of action, or a repetition of action. By means of i f, i f-el se, and swi tch 
tatements, a selection among alternative actions can be made. By means of whi 1 e, for, 
|fid do statements, iterative actions can be taken. Because the relational, equality, and 
Hjgical operators are heavily used in flow of control constructs, we begin with a thor¬ 
ough discussion of these operators. They are used in expressions that we think of as 
feeing true or false. We also discuss the compound statement, which is used to group 
together statements that are to be treated as a unit. 
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Relational, Equality, and Logical Operators 



he following table contains the operators that are often used to affect flow of control: 
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Relational, equality, and logical operators 


less than 

< 


greater than 

> 

Relational operators 

less than or equal to 

<= 

greater than or equal to 

>= 


Equality operators 



Logical operators 


equal to 
not equal to 


(unary) negation 
logical and 
logical or 




• WiV 
> r-~ '•? 












Si 


lifll 

llfpfj 

mmm 

l SI 

§m 

■~~ .* '> 


'-;■' Ip 

•// '.*\-prSifyi 

SK 


’Wmi 

'fmm 

gall 

g-iiiii 

HI 

fSlllb 

f|||||p 

P?S5p«$*|2 

MlL 

SMlf 

^lljllp 

'Hi 


mm i 
OH 

bwbsmi 


Chapter 4 


Flow of Control 


' 'ywf&* 

'Isa 


^0 


jjgg| 

Just as with other operators, the relational, equality, and logical operators have rules of 
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'?Y£$gm 


expressions involving these m 


■ Jl 





4.2 T Relational Operators and Expressions 
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mm,-: 


++ ( postfix) 


+ (unary) - (unary) 


* / % 


Operators 


-- (postfix) 


++ (prefix) 


(prefix) 



Associativity 


left to right 


left to right 


left to right 


left to right 


left to right 


left to right 


left to right 


= += -= *= /= etc 

right to left 

, (comma operator ) 

left to right 


The ! operator is unary. All the other relational, equality, and logical operators are 
binary. They operate on expressions and yield either the i nt value 0 or the i nt value 1. 
The reason for this is that in the C language, false is represented by the value zero and 
true is represented by any nonzero value. The value for false can be any zero value; if) 
can be a 0 or 0.0 or the null character ' \0' or the NULL pointer value. The value for true 
can be any nonzero value. Intuitively, an expression such as a < b is either true oij 
false. In C, this expression will yield the i nt value 1 if it is true and the i nt value 0 if it 



Relational Operators and Expressions 


.The relational operators 


Je all binary. They each take two expressions as operands and yield either the -in 
value 0 or the i nt value 1. 1 >1L1CI eiltlei l:,u 1 n 


relational _ expression 


expr < expr 
expr > expr 
expr <= expr 
expr >= expr 


ime examples are 


a < 3 
a > b 

-1.3 >= (2.0 * x + 3.3) 
a < b < c 

t not 


/ syntactically correct, but confusing */ 


a -< b /* out of order */ 

a < = b /* space not allowed */ 

a » b /* this is a shift expression */ 

gjjjgonsider a relational expression such as a < b. If a is less than h th™ ** „ . 

> the i nt value 1 , which we think of as heimr i,n,o if • 11 t ie ex P n 'ssion 

r pvnvot'oinn i -» ^ being true. If a, is not less than b then thp 

jfP b is hi vle'o^- T Ma,h “ y '"* value 

factors is less than that of the arithmetic operators, the^pressLT ““ 

fp Si , 

ffll a - b < 0 is equivalent to (a - b) < 0 

expression such as a < bis implemented as a - b < 0 The 
Jf 181 anthmetic conversions occur in relational expressions 

Bake of T- % toerm!™ 7 ,f thl f etlC Thc table shows how the 

|£# ue 01 el e2 determines the values of relational expressions: 


ISp 


%CL- 

• ■ 

itf.y'-.' ■ - 

wm&s&gmmSmm 


is false. 


k ■■■b 

■■ • - ■ 




if. 


y- - 3 



b < 0. The 
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Values of relational expressions 

a - b 

a < b 

a > b 

a <= b 

a >= b 

positive 

0 

1 

0 

i 

zero 

0 

0 

1 

1 

negative 

i 

0 

l 

0 


The following table illustrates the use of the rules of precedence and associativ¬ 
ity to evaluate relational expressions: 


! Declarations and initializations 


char c « ’w* 

int i = 1, j « 2, k = -7 

double x = 7e+33, y = 0,001 I 

Expression 

Equivalent expression 

Value 

1 a" 4* 1 < c 

a + 1) k c 

1 

- i - 5 * j >= k + 1 


0 

3 < j < 5 

(3 < j) < 5 

1 

x - 3.333 <- x + y 


1 

x < x + y 

x < (x + y) 

0 


Two expressions in this table give surprising results in that they do not conform 
to rules as written mathematically, hr mathematics, one writes 

3 < j < 5 

to indicate that the variable j has the property of being greater than 3 and less than 5. 
It can also be considered as a mathematical statement that, depending on the value of j, 
may or may not be true. For example, if j — 4, then the mathematical statement 

3 < j < 5 

is true, but if j = 7, then the mathematical statement is false. Now consider the C code 


3 = 7; 

printf("%d\n", 3 < j < 5); 


/* 1 gets printed, not 0 */ 
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ggpi ! 

||| analogy with mathematics, one might expect that the expression is false and that 0 is 
Nte^pnted, However, that is not the case. Because relational operators associate from left 

■ plight, 

B| 3 < j <5 is equivalent to (3 < j) < 5 


jjjpiecause the expression 3 < j is true, it has value 1. Thus, 

pHEpI 

BSi'' (3 < j) < 5 is equivalent to 1 < 5 

mmm 


has value 1. In C, the correct way to write a test for both 3 < j and j < 5 is 




■il " 3 < j && j < 5 


ggdlp^ause the relational operators have higher precedence than the binary' logical opera- 
H, this is equivalent to 


-vi 




< j) && (j < 5) 

jjj|j|j|jd, as we will see later, this expression is true if and only if both operands of the && 



jjj||Iexpression are true 


numbers that are representable in a machine do not have infinite precision 
Hlometimes, this can cause unexpected results. In mathematics the relation 


gsf 


iSSi 


marnKB x < x + y 
s#5tt«i§ > 


is equivalent to 0 < y 


^athemattcally, if y is positive, then both of these relations are logically true. Computa- 

Hp nafl y- if x is a floatin 8 var iable with a large value such as 7e+33 and y is a floating 
loanable with a small value such as 0.001, then the relational expression 


if 


■ JISox < x + y 


Hl a y be f alse < even though mathematically it is true. An equivalent expression is 




§f£l>S&V (X 

WMmxm: : 


(x 4- y)) < 0.0 


'■'< ;V - 

«|hd it is this expression that the machine implements. If, in terms of machine accuracy, 
SteaP values of x and x + y are equal, the expression will yield the i nt value 0. 
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4.3 Equality Operators and Expressions 


The equality operators == and ! = are binary operators acting on expressions. They yield 
either the int value 0 or the int value 1. The usual arithmetic conversions are applie 
to expressions that are the operands of the equality operators. 
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e next table shows how the rules of precedence and associativity are used to evaluate 
lie expressions with equality operators. 


Declarations and initializations 


mil 



'mm 


Expression 


equality-expression :: = expr == expr 
expr ::= expr 

Some examples are 

c == 1 A 1 
k ! = -2 

x + y==3*z-7 


expr != expr 







- r--- 



Equivalent expression 


3 == i 


Value 



C(i + j) + k) == (C- 2) * (- k)) 



wmi 



Observe that the expression 



“If a 1= b is equivalent to ! (a 


mm 


but not 


a = b 
a = = b 
(x + y) 


/* an assignment statement */ 

/* space not allowed */ 

/* syntax error: equivalent to (x + y) 






Mm 


f Also, note carefully that the two expressions 


m 


a 


b and a 


= 044) */ 


Intuitively, an equality expression such as a == b is either true or false• An equivaleiflj 
expression is a ~ b == 0, and this is what is implemented at the machine level. If J 
equals b, then a ~ b has value 0 and 0 == 0 is true . In this case the expression a == 
will yield the i nt value 1, which we think of as true If a is not equal to b, then a == 
will yield the i nt value 0, which we think of as false. 

The expression a ! = b uses the not equal operator. It is evaluated in a similar fasj|j 

-V ►* * 1 * * _ .ft- 1 A 4~ U X /A 1 31 



SB! visually similar. They are close in form but radically different in function. The 
oppression a == b is a test for equality, whereas a = b is an assignment expression. 
Citing 


; i f (a = 1) 


stead of 


/* do something */ 


wmm 



if (a 


« » • * » 


/* do something */ 


ids a common programming error. The expression in the first i f statement is always 
true, and an error such as this can be very difficult to find. 
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4.4 T Logical Operators and Expressions 


155 


-M 


4.4 Logical Operators and Expressions 


The logical operator ! is unary, and the logical operators && and | | are binary. All of| 
these operators, when applied to expressions, yield either the int value 0 or the intjj 

value 1. 

Logical negation can be applied to an expression of arithmetic or pointer type. If atj9 
expression has value zero, then its negation will yield the i nt value 1. If the expression 
has a nonzero value, then its negation will yield the i nt value 0. 


logicaLnegation„expression :: 
Some examples are 


expr 


!a 

! (x r- 7,7) 

! (a < b || c < d) 


m 


but not 


a! 

a != b 


/* out of order */ 

/* 1= is the token for the n not equal M operator / 




' • 'd.'MP. 

The usual arithmetic conversion rules are applied, to expressions that are the operand|j 
of !. The following table gives the semantics of the ! operator; 


Values of: 
expr / expr 


zero 

nonzero 


1 

0 


■■xt. 


1# 




Although logical negation is a very simple operator, there is one subtlety. Th|f 
operator ! in C is unlike the not operator in ordinary logic. If s is a logical statement, 
then 


not knots') 


fehereas in C the value of !! 5, for example, is 1. Because ! associates from right to left, 
the same as all other unary operators, the expression 


! ! 5 is equivalent to ! (! 5) 


jjjJjJSIj^ ! (! 5) is equivalent to ! (0), which has value 1. The following table shows how 
®^®|bine expressions with logical negation are evaluated: 




Declarations and initializations 




char 

int 

double 


c = 1 A ’ 
i = 7, j = 7; 
x = 0.0, y = 2.3 




- 




Expression 

Equivalent expression 

Value 

! c 

! c 

0 

! d - 3) 

! Ci - j) 

1 

I i - j 

i 

CM) - j 

-7 

1 I (x + y) 

! (! Cx + y)) 

i 

mm 

a x) * a a y )) 

1 




The binary logical operators && and | | also act on expressions and yield either 
the i nt value 0 or the i nt value 1. The syntax for a logical expression is given by 


togicaL expression 


mu 

. 


logical _ negation _ expression 
logical_or_ expression 
logical_and_ expression 
logical_or_expression ::= expr || expr 
logicaLancL e.xpression : := expr && expr 


examples are 






a && b 
a II b 

!(a < b) && c 
3 && (-2 * a + 7) 


not 





/* one operand missing */ 

/* extra space not allowed */ 

/* this is a bitwise operation */ 
/* the address of b */ 

























mm 

bs£- - 
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The operator semantics is given by the following table: 




Values of; 


exprl 

expr2 

zero 

zero 

zero 

nonzero 

nonzero 

zero 

nonzero 

nonzero 


exprl <&& expr2 exprl || expr2 
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^bit-circuit Evaluation 

®the evaluation of expressions that are the operands of && and | |, the evaluation pro- 
||g| stops as soon as the outcome true or false is known. This is called short-circuit 
«Sluation. It is an important property of these operators. Suppose that exprl and 
mvr2 are expressions and that exprl has value zero. In the evaluation of the logical 


The precedence of && is higher than 1 I, but both operators are of lower precedence 
than all unary, arithmetic, and relational operators. Their associativity is left to right; 
The next table shows how the rules of precedence and associativity are used to com¬ 
pute the value of some logical expressions. 


Declarations and initializations 


. 



har 

c = 

’ B ’; 

nt 

It 

1 = 

3, j = 3, k = 3; 

ouble 

X = 

0.0, y = 2.3; 


Expression 


i && j && k 


j - B 


i < j && x < y 


x || Ci && (j - 3)) _ 

(i < j) && (x < y)_ 


i < 3 


x < y 



1 == 'A' || c + 1 == 


Equivalent expression 


(i && j) && k 


(i < j) II (x < y) _ 

CA’ <= c) && (c <= ’Z’) 


((c - 1) == ’A') || ((c + 1) 


Value 




xpression 

exprl && expr2 

llie evaluation of expr2 will not occur because the value of the logical expression as a 
ole is already determined to be 0. Similarly, if exprl has nonzero value, then in the 
evaluation of 

exprl | | expr2 

e evaluation of expr2 will not occur because the value of the logical expression as a 
hole is already determined to be 1. 

Here is a simple example of how short-circuit evaluation might be used. Suppose that 
e want to process no more than three characters. 




SPIPPI 

\KA.l, ifefes 

i. 


int cnt = 0; 

while (++cnt <= 3 && (c = getcharQ) != EOF) { 
. /* do something */ 


When the expression ++cnt <= 3 is false, the next character will not be read. 


vs? 


Wmsmm 



[ 

■■■ ■■■.! 

Wmmm 


mm 

mm. 

ijjiiiwX* 

mm 


c -yvK-i, 


;= * Z ' ) 


mm 1 

mm 



The usual arithmetic conversions occur in expressions that are the operands ol logical 
operators. Note that many of the expressions in the table are of mixed type. Whenever 
this occurs, certain values are promoted to match the highest type present in the 
expression. 


msm.» j v W; j 

5 x£Ew*&r. 

ft lip 
M*Wm 

& u 




— 


m . »_ 
fei M 

WtM H 

mm 


aft'.--:-. |I1 




mm xx 
m 

«aaSi'.t'itt ‘,9m 


% d 
m ;i 

Illfll 

.in H 

: 8 l 
WMSi- fa 

wllwl 
gk\ 4 

BaggS* ‘mm 



p.5 The Compound Statement 

A compound statement is a series of declarations and statements surrounded by 
braces. 

|? compound_statement : := { { declaration } 0+ { statement } 0+ } 


I The chief use of the compound statement is to group statements into an executable 
I unit. When declarations come at the beginning of a compound statement, the com- 
5 pound statement is also called a block. (See Section 5.10, “Scope Rules,” on page 213, 
■(for a further discussion of blocks.) In C, wherever it is syntactically correct to place a 
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statement, it is also syntactically correct to place a compound statement; A compound g 
statement is itself a statement. An example of a compound statement is 


{ 


} 




a += b += c; 
printf("a = %d, b 


c = %d\n", a, b, c); 


: 8 ! 


and another example is 




{ 


} 


a 

{ 


} 


■ i; 

b = 2; 
c = 3; 


m 


This example has an inner compound statement within an outer compound statement. |j| 
Compound statements and blocks nest. Also, we follow an indentation style where 
statements within the brace are offset an additional three spaces. This lets us visually 
identify the grouping of statements within the braces. An important use of the com¬ 
pound statement is to achieve the desired flow of control in if, i f-el se, whi 1 e, for, 
do, and switch statements. 






4.6 The Expression and Empty Statement 

The expression statement is an expression followed by a semicolon. The semicolon^ 
turns the expression into a statement that must be fully evaluated before going on to 
the next program statement. 

expr_statement ::= { expr } opt ; 

The empty statement is written as a single semicolon. It is useful where a statement is 
needed syntactically, but no action is required semantically. As we shall see, this is 
sometimes useful in constructs that affect the flow of control, such as i f-el se state¬ 
ments and for statements. The empty statement is a special case of the expression 
statement. 




■swm 


li*§ 


jJjItSoHie examples of expression statements are 


5 '00. 


a ” b, 

a + b + c j 


plf % 
f -vy ■: 


printf("%d\n", a) 


/* an assignment statement */ 

/* legal, but no useful work gets done */ 
/* an empty statement */ 

/* a function call */ 




All of these examples arc expression statements. However, the assignment state- 
lalnt and function call are important enough to be given their own designations. In an 
Impression statement, the expression is evaluated, including ail side effects. Control 
hen passes to the next statement. A typical case is a = b; where a is assigned the 
ue of b. 




U . . ’T-y’i.; % , • 






4.7 The if and the if-else Statements 






The general form of an i f statement is 


m i 


P. - < T ' 
fee s*. ' 


Jgtif (expr) 

statement 


Ip3| expr is nonzero (true), then statement is executed; otherwise, statement is skipped 


Wmr - 


K4ahd control passes to the next statement. In the example 


®l!|S#. : if (grade >= 90) 


printf("Congratulations!\n"); 

8§jJprintf("Your grade is %d.\n", grade); 


; a congratulatory message is printed only when the value of grade is greater than or 
p Mjfe qual to 90. The second pri ntf () is always executed. The syntax of an if statement is 
given by 

if .statement ::= if (expr) statement 

jljpgpJsually the expression in an i f statement is a relational, equality, or logical expression, 
HjSptit as the syntax shows, an expression from any domain is permissible. Some other 
■Mixamples of i f statements are 
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if (y != 0.0) 
x /= y; 

if ( C =='•){ 

++blank_cnt; 

printf("found another blank\n"); 

} 
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mm 




mm 




y is true, then mi n will be assigned the value of x, and if it is false, then mi n will 
fljljUssigned the value of y. The syntax is given by 


■■ 




in 

1 


MKMv .-.' if^elsestatement 


if (expr) statement 
else statement 




but not 


#1 


example is 


if b == a 
area = < 


/* parentheses missing */ 


73 





iSP 


if (c >= 'a' && c <= 'z') 
» ++lc_cnt; 


else { 


Where appropriate, compound statements should be used to group a series of state 
ments under the control of a single i f expression. The following code consists of two 
i f statements: 

if (j < k) 
min = j; 

if (j < k) ; i 

printf("j is smaller than k\n"); i 

The code can be written to be more efficient and more understandable by using a single! 
i f statement with a compound statement for its body. Jf 

if (j < k) { 
min = j; 

printf("j is smaller than k\n"); fj 

} 1 


m. \ 
' 






++other_cnt; 

printf("%c is not a lowercase letter\n", c) 




f^ghut not 


THfe if ci s= j) { 




The if-else statement is closely related to the if statement. It has the general 


form 



Jmm. i += 1; 

] -f*— Z. j 

WmSs-i i I 

m§i, else 

i -= j; /* syntax error */ 

BgjJ|| 

CT The syntax error occurs because the semicolon following the right brace creates an 
j§§ empty statement, and consequently the else has nowhere to attach. 

*wj^~ Because an i f statement is itself a statement, it can be used as the statement part of 
another i f statement. Consider the code 

■Hfii 

WjBBMWt • c. /•__, \ 


if (expr) jjj 

statement 1 } 

else I 

statement2 $ 

If expr is nonzero, then statement 1 is executed and statement 2 is skipped; if exprig 
zero, then statementl is skipped and statement2 is executed. In both cases control then 
passes to the next statement. Consider the code j 

if (x < y) i 

min = x; 
el se 


wm 


air 




8■ 


H 


$ SIS 
W§i- 


Up 


■ JBliifc if (a == 1) 

W'l; if (b == 2) 

printf("***\n") 

This is of the form 

JIB® if (a == 1) 

IISRII statement 


I# 


c >.; 


ffi 


jjjgwhere statement is the following i f statement 

l; if (b == 2) 

printf("***\n"); 




W* : 

j 


: ’-'-7 ' 


I 

■ '. r 


SHI 


'T> 


gw '• 

iSSi 


' v'-- 




m 
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In a similar fashion, an if-else statement can be used as the statement part o 
another i f statement. Consider, for example, 

if (a == 1) 
if (b == 2) 

printf("***\n"); 
else 

printf("###\n"); 

Now we are faced with a semantic difficulty. This code illustrates the dangling efee 
problem. It is not clear from the syntax what the el se part is associated with. Do not 
fooled by the format of the code. As far as the machine is concerned, the following cod 
is equivalent: 

if (a == 1) 
if (b == 2) 

printf("***\n"); 

el se 

printf("###\n"); 

The rule is that an else attaches to the nearest if. Thus, the code is correctly format-.: 
ted as we first gave it. It has the form 

if (a == 1) 
statement 

where statement is the i f-el se statement 

if (b == 2) 

printf; 
el se 

printf("###\n"); 


rUn i/uhi’ 1 £* mpnf* 

9 nt! Wii I i c jIcUciU t.ill 

ffetition of action is one reason we rely on computers. When there are large amounts 
it is very convenient to have control mechanisms that repeatedly execute spe- 
fic statements. In C, the whi 1 e, for, and do statements provide for repetitive action. 

I Although we have already used the whi 1 e statement, or while loop, in many exam- 
|es, we now want to explain precisely how this iterative mechanism works. The syntax 
| given by 



while (expr) statement 


&whilestatement : 
Home examples are 

while (i++ < n ) 
factorial *= 


while ((c = getcharO) i= EOF) 
if (c >= 'a' && c <= 'z') 
++1owercase_l etter_cnt; 
++total_cnt; 

} 


{ 


but not 


/* syntax error: do is not allowed */ 


1 while (++i < LIMIT) do { 
i j = 2 * i + 3; 

IP pri ntf("%d\n", j); 

4 } 

m1|^ • 

SfiSfSfc Consider a construction of the form 

I®feiPlife; 

I while ( expr ) 

| • statement 

next statement 

First expr is evaluated. If it is nonzero (true), then statement is executed and control is 
passed back to the beginning of the while loop. The effect of this is that the body of the 
while loop, namely statement, is executed repeatedly until expr is zero (false). At that 
point, control passes to next statement. Thus, the effect of a while loop is that its body 
gets executed zero or more times. 
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It is possible to inadvertently specify an expression that never becomes zero, an| 
unless other means of escaping the whi 1 e loop are introduced, the program is stuck ip 
an infinite loop. Care should be taken to avoid this difficulty. As an example, consider 
the code 

printf("Input an integer: "); 
scanf("%d", &n); 
while (—n) 

. /* do something */ 

The intent is for a positive integer to be entered and assigned to the i nt variable n. 
Then the body of the whi 1 e loop is to be executed repeatedly until the expression —n 
is eventually zero. However, if a negative integer is inadvertently assigned to n, then the 
loop will be infinite. To guard against this possibility, it would be better to code instead 

while (—n > 0) 

. /* do something */ 

It is sometimes appropriate for a whi 1 e loop to contain only an empty statement. 
A typical example would be 

while ((c = getcharO) == ' ') 

; /* empty statement */ 

This code will cause blank characters in the input stream to be slapped. We could have 
written this as 

while ((c = getcharO) == ' '); 

However, it is considered good progr ammin g style to place the semicolon on the next 
line by itself so that it is clearly visible as an empty statement. 

Our next program illustrates the ideas that we have presented in this section. The 
program counts various kinds of characters. 
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e&iniu 


cnt_char.c 






/* count blanks, digits, letters, newlines, and others. */ 
#include <stdio.h> 


mi wmmm 







■flpStv ’ 

— i 


int main(void) 

111 int blank_cnt = 
letter_cnt 


0, c, digit_cnt = 0, 

= 0, nl_cnt = 0, other_cnt 


while ((c = getcharO) != EOF) /* braces not necessary */ 
if (c == ' ') 

++blank_cnt; 

else if (c >= '0' && c <= '9') 

++digit_cnt; 

else if (c >= 'a' && c <= 'z' || c >= 'A' && c <= 'Z') 
++letter_cnt; 
else if (c == '\n') 

++nl_cnt; 

else 

++other_cnt; 

printf("%10s%10s%10s%10s%10s%10s\n\n", 

"blanks", "digits", "letters", "lines", "others", "total"); 
printf("%10d%10d%10d%10d%10d%10d\n\n", 

blank_cnt, digit_cnt, letter_cnt, nl_cnt, other_cnt, 
blank_cnt + digit_cnt + letter_cnt + nl_cnt + other_cnt); 
return 0; 








^Kites'*- £ 




iglflll 
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To execute this program, using its source file for data, we give the command 


Ilit 


WWSB0&;'-- 

\ 

Bli 

- Ty- - 


cnt_char < cnt_chair.c 
Here is the output that appears on the screen: 


blanks 

197 


digits 

31 


letters 

348 


li nes 
27 


others 

180 


total 

783 









ms 


s: . 

p$:j 


,,;-A 7- ? ‘ 7’, 

3«l»i 


as»a 


M *&£Mi l '0zp 


a 

HeW* 
®1 
H 


l@PS 


i ' ft 'nj 
. ■ , -. ;,: • .. 

mms, ’■ mS mmm 

" "“fas - r WjSS®** 
fe-, 




H-.;V\; 


4r. 


4.9 T The for Statement 1 67 


Dissection of the cnt^chav Program 

a while ((c = getcharO) != EOF) /* braces not necessary */ 

Braces are unnecessary because the long i f-el se construct that follows is a sin 
statement that comprises the body of the while loop. This while statement repeate 
executes the compound if-else statement that follows it. Braces are unnecess 

because the whole if-else construct is a single statement. 


sM 



: fj#P 

/ 


A n 


The for Statement 




mm 

Wm-. v<0 

MgMMBpgfcaii 




fe l K for statement, like the whi 1 e statement, is used to execute code iteratively. We can 

i 1 « » **» ^ _ r-rf'i . . « 


■•V-:■■■ ■ ' ■ 

mmi 


plain its action in terms of the whi 1 e statement. The construction 


&& c <= ’9’) 


&& c <= 


i „ f 

z 


c >= ’A' && c <= ’Z') 


■ if (c == 1 ') 

++blank_cnt; 

else if (c >= '0' && c <= '9') 

++digit_cnt; 

else if (c >= 'a' && c <= 'z' || c >= 'A' && c <= 'Z'l 
++letter__cnt; 
else if (c == '\n') 

++nl_cnt; 
else 

++other_cnt; 

This statement tests a series of conditions. When a given condition evaluates to true it 
corresponding expression statement is executed and the remaining tests are the! 

for P countfn° r ea f? character read int0 the variable c, exactly one of the variables used 
ioi counting will be incremented. 



sm 




I ffor C exprl; expr2 ; expr?) 
MSS statement 
|g|§next statement 

teemantically equivalent to 




. .. 

exprl; 
HHHfe while 


mm 










while (expr2) { 
statement 
expr3 ; 

} 

next statement 



■ 

, V- ; V ‘ - 


Note that logically we could have written the i f-el se as 


if (c == • ■) 

++blank_cnt; 

else 

if (c >= ’0’ && c <= ’9') 
++digit_cnt; 
el s e, 

if (c >= 'a' && c <= 'z' 

, ++letter_cnt; 

else 






provided that expr2 is present, and provided that a conti nue statement is not in the 
ijjr I body of the for loop. From our understanding of the whi 1 e statement, we see that the 
jjllf semantics of the for statement are the following. First, exprl is evaluated. Typically, 
jp* exprl is used to initialize the loop. Then expr2 is evaluated. If it is nonzero (true), then 
Blpi statement is executed, expr3 is evaluated, and control passes back to the beginning of 
§gSft the for loop, except that evaluation of exprl is skipped. Typically, expr2 is a logical 
BBC , expression controlling the iteration. This process continues until expr2 is zero (false), at 
jjp.a which point control passes to next statement. 

The syntax of a f o r statement is given by 


c >= 'A' && c <= 'Z') 


mmsBZi 

*L:K-y ' 


H- fas® 

Mi'* 

-t 


Bfpf* mm 

mm i 

■ll V £j 




for^statement 


for (expr ; expr ; expr) statement 


!; Some examples are 


siiiisi 




I ‘mmsS 


iS f p 0 u Tiatte 1 d this way ’ each of the if-else pairs lines up to show the 

lomj rhain tC f ^ 4 :° 1 e code ' However, this coding style is not recommended because 
ong cnatns ot i f-el se statements can cause the code to march too far to the right 
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for (i =1; i <= n; ++i) 
factorial *= i; 

for (j =2; k % j == 0; ++j) { 

printf("%d is a divisor of %d\n", j, k); 
sum += j; 


n 1 iMM 

m s ^ -i 

m^JSs MH 








m -, m ■ h 
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but not 

f ° r sutri += 0 i- 1- < n ’ 1 += 3 ' ) semico ' lor,s are needed */ 

^st°rema?n * 1 ? eXpressions in a for statement can be missing, but the two 
fo r IooTt? e code “ then n ° ^^ation step is performed as 

i'=l; 
sum = 0; 

for ( ; i <= 10; ++ -j) 
sum += i ; 

computes the sum of the integers from 1 to 10, and so does the code 

i=l; 
sum = 0; 

for ( ; i <= 10 . ) 
sum += i+ +; 

loop taThfcode ^ WhCn eXpr2 iS missin S is that the test is always true. Th 

1 — 1 * 

I 

sum = 0; 
for ( ; ; ) { 

SUH1 += l-f-f; 

printf("%d\n", sum); 
is an infinite loop. 

A for statement can be used as the statement part of an i f, i f- e l se 
another fo r statement. For example, the construction 

for 

for ( ..... ) 

for ( . ) 

statement 


is a single for statement. 
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MB?.' 




m-. m 


I WBm 


wmwm 

mimm 

ms mm 


many situations, program control can be accomplished by using either a whi 1 e or 
ft® for statement. Which one gets used is often a matter of taste. One major advantage 
Bl t T^f a for loop is that control and indexing can both be kept right at the top. When loops 
|P§ f.p nested, this can facilitate the reading of the code. The program in the next section 
(ffillustrates this. 
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4.10 An Example: Boolean Variables 

Boolean algebra plays a major role in the design of computer circuits. In this algebra all 
variables have only the values zero or one. Transistors and memory technologies imple¬ 
ment zero-one value schemes with currents, voltages, and magnetic orientations. Fre¬ 
quently, the circuit designer has a function in mind and needs to check whether, for all 
possible zero-one inputs, the output has the desired behavior. 

We will use i nt variables bl, b2, ..., b5 to represent five boolean variables. They will 
be allowed to take on only the values 0 and 1. A boolean function of these variables is 
one that returns only 0 or l.A typical example of a boolean function is the majority 
function; it returns 1 if a majority of the variables have value 1, and 0 otherwise. We 
want to create a table of values for the functions 


and bl && b2 


b4 && b5 


and the majority function. Recall that logical expressions always have the i nt value 0 
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In file booLvals.c 

/ Print a table of values for some boolean functions. */ 
#include <stdio.h> 


int main(void) 

1 nt bl, b2, b3, b4, b5; 

int cnt = 0; 


/* boolean variables */ 


/* headings */ 


P "t f 5,>^s%5s%5s%5s%Ss%5s%7s%7s%lls\n\n", /* headi 

Cnt", "bl", "b 2 ", "b 3 " "b 4 " "b 5 " 

fctl", "fct 2 ", "majority")• ’ 
for (bl - 0 ; bl <: 1 ; l + bi) ’ 
for (b 2 = 0 ; b 2 <= 1 ; ++b2 ) 
for (b3 = 0 ; b3 <= 1; ++ b3) 
for (b4 = 0; b4 <= 1 ; ++b4) 
for (b5 = 0 ; b5 <= 1 ; ++b5) 

printf("%5d%5d%5d%5d%5d%5d%6d%7d%9d\n" 

++cnt, bl, b2, b3, b4, bS, 

bl I I b3 M b5, bl && b 2 || b 4 && b 5 , 

Dl + b 2 + b 3 + b 4 + hs - 
putchar("\n') ; + D:> >_ > 

return 0 ; 


The program prints a table of values for all nncciFio inmito 

It illustrates a typical use of nested for 100 ^ ^ corresponding outputs 

>P ear use 01 nested for loops. Here is some output of the program: 


Cnt 

2 


fctl 


fct2 

0 

0 


majority 
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■ ams 
- 

$ m 

aiiftai; 

■ 


I g The Comma Operator 

* comma operator has the lowest precedence of all the operators in C. It is a binary 
®tor with expressions as operands. The comma operator associates from left to 

wemsM 


-right. 


comma^expression 


expr , expr 


ggsgagp 1 -., ■ | 

HHj^^comma expression of the form 


BillR •• K -■ 

% expr 1 , expr2 

9MBM&& .W is evaluated first, and then expr2. The comma expression as a whole has the value 
and type of its right operand. An example is 

■fftwii It 

M8Hia = 0, b = 1 


■br. 

r .J|g|§§ b has been declared an i nt, then this comma expression has value 1 and type i nt. 
^^^fj lThe comma operator sometimes gets used in for statements. It allows multiple ini- 
P^^Salizations and multiple processing of indices. For example, the code 

p|§|p|s v 

■BB^r f for (sum = 0, i - 1 ; i <= n ; ++i) 


for (sum = d 
sum +~ i; 


WmP* 5 1 » 

KBS* m. 

can be used to compute the sum of the integers from 1 to n. Carrying this idea further, 
§m pve can stuff the entire body of the for loop inside the for parentheses. The previous 
Rfeti, code could be rewritten as 


for (sum =0, i =1; i <=n; sum += i, ++i) 


If 


m: 


blit not as 

iter 

mm 

iMlii -fnr ( 




for (sum = 0, i = 1; i <= n; ++i, sum += i) 


In the comma expression 


+-fi, sum += i 


the expression ++i is evaluated first, and this will cause sum to have a different value. 
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The comma operator should be used only in situations where it fits naturally. So faf|j| 
we have given examples to illustrate its use, but none of the code is natural. In SectiofSBj 
10.2, “Linear Linked Lists,” on page 449, where we discuss linked lists, we will have® 
occasion to keep track of an index and a pointer at the same time. The comma operatorlll 
can be used in a natural way to do this by writing fjjjj 

for (i =0, p = head; p != NULL; ++i , p = p -> next) ..|j|| 




gggij 


» y ' ” 

mII 


= 0 \ 

sum = 0; 

/k sum a series of integer inputs until 0 is input */ 
do { 

sum += i; 




***** 


Examples of comma expressions are given in the following table: 


Declarations and initializations 






IIS scant("%d", &i); 

IS} while (i >0); 

Consider a construction of the form 





int 

double 


i, j, k = 3 
x = 3.3; 


do 

statement 
while (expr ); 
next statement 


mwmmi 


Expression 


Equivalent expression 


Value 


statement is executed and expr is evaluated. If the value of expr is nonzero (true), 
I^^^^^Sithen control passes back to the beginning of the do statement and the process repeats 
'■Mr U itself. When expr is zero (false), then control passes to next statement. 


■ lgf t|l’ As an example, suppose that we want to read in a positive integer and that we want 
SlllSlt} insist that the integer be positive. The following code will do the job: 


i = 1, j = 2, ++k+l 
l< != 1, ++ x * 2.0 + 1 


(Ci 
(k ! 


1 ), 

1 ), 


(j = 2)), ((++ k) + 1) 
((C++ X) * 2.0) + 1) 


9.6 


■ mssp 


'mmM KBi 
IKil z 


Most commas in programs do not represent comma operators. For examplej 
those commas used to separate expressions in argument lists of functions or used 
within initializer lists are not comma operators. If a comma operator is to be used in 
these places, the comma expression in which it occurs must be enclosed in parentheses.] 






do { 

printf("Input a positive integer: "); 
scanf("%d", &n); 
if (error = (n <= 0)) 

printf("\nERR0R: Do it again!\n\n"); 
} while (error); 


■ jPpf a nonpositive integer is entered, the user will be notified with a request for a positive 
ffiMpiteger. Control will exit the loop only after a positive integer has been entered. 

lilllik" Dar>ont.Q in P nnh? n ivi:i 11 ,"»f lnnnt fn Ho /-in lnnne it ic rv-meirlfireiH 


4 ^ mmmm m W 

.12 The do Statement 






§§§£: Because in C, only a small percentage of loops tend to be do loops, it is considered 
fligood programming style to use braces even when they are not needed. The braces in 
SC the construct 


The do statement can be considered a variant of the whi 1 e statement. Instead of mak¬ 
ing its test at the top of the loop, it makes it at the bottom. Its syntax is given by 

do_statement ::= do statement while (expr) ; 

An example is 


do { 

a single statement 
} while (.); 


mm 


jjppf make it easier for the reader to realize that you have written a do statement rather than 
■HKa while statement followed by an empty statement. 

|||§p£ Now that; we have discussed in detail the i f statement, the i f-el se statement, and 
the various looping statements, we want to mention the following tip that applies to all 
By 5 of their control expressions. It is good programming style to use a relational expres- 
jj§§ : ;/ sion, when appropriate, rather than an equality expression. In many cases this will 
result in more robust code. For expressions of type float or double, an equality test 




: ■ 
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W8& 
&£5l ' 


can be beyond the accuracy of the machine. Here is an example, which on most 
machines goes into an infinite loop: , i 

In file loop.c 

/* A test that fails. */ 

#include <stdio.h> 

int main(void) 

{ 

int cnt = 0; 

double sum = 0.0, x; * 


IP* 


■ ■ IS 

im 

mm 


rm 


W^mm 


for (x = 0.0; x != 9.9; x += 0.1) { 
sum += x; 

printf("cnt = %5d\n", ++cnt); 

} 

printf("sum = %f\n", sum); 
return 0; 


/* trouble! */ 




m IS 
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We want to write a program that prints Fibonacci numbers and quotients. If f 1 con¬ 
tains the value of the current Fibonacci number and f 0 contains the value of the previ¬ 
ous Fibonacci number, then we can do the following: 


Wm*'. 

V&S&Zf: ‘ 


mmk 




1|®§S:I' 


^^>3?t ClF/S I- ' ' Vr 

»i WSmm 




gSSJ* 




' /A 

;/.w \ 


1 Save the value of f 1 (the current Fibonacci number) in a temporary. 

2 Add f0 and f 1, and store the value in f 1, the new Fibonacci number. 

3 Store the value of the temporary in f0 so that f0 will contain the previous 
Fibonacci number. 

4 Print, and then repeat this process. 

Because the Fibonacci numbers grow large very quickly, we are not able to compute 
many of them. Here is the program: 

In file fibonacci.c 

/* Print Fibonacci numbers and quotients. */ 

#include <stdio.h> 


4.13 An Example: Fibonacci Numbers 

The sequence of Fibonacci numbers is defined recursively by 


fa = 0 


f a 

i + r 


i-l 


for i = 1, 2 


5 *“ J 


Except for f 0 and f x , every element in the sequence is the sum of the previous two ele¬ 
ments. It is easy to write down the first few elements of the sequence. 

0, 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, 233, . 

Fibonacci numbers have lots of uses and many interesting properties. One of the prop¬ 
erties has to do with the Fibonacci quotients defined by 




for 


2, 3, 


It can be shown that the sequence of quotients converges to the 
the real number (1 + sqrt 5) / 2. 


-if 


ifte&gz*}:-.; 
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golden mean, which is 
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#define LIMIT 46 

int main(void) 

{ 

long f0 = 0, fl = 1, n, temp; 

printf("%7s%19s%29s\n%7s%19s%29s\n%7s%19s%29s\n 
/* headings*/ 

" ", "Fibonacci", "Fibonacci", 


" n”, " number", " quotient", 

II __ II II___II II___II , 

printf("%7d%19d\n%7d%19d\n", 0, 0,’l, 1); /* 1st 2 cases */ 

for (n = 2; n <= LIMIT; ++n) { 
temp = fl; 
fl += f0; 
f0 = temp; 

printf("%71d%191d%29.16f\n", n, fl, (double) fl / f0); 

} 

return 0; 
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Here is some of the output of the program: 



Fibonacci 

Fibonacci 

n 

number 

quotient 

0 

0 


1 

1 


2 

1 

1.0000000000000000 

3 

2 

2.0000000000000000 

4 

3 

1.5000000000000000 

5 

5 

1.6666666666666667 

6 

8 

1.6000000000000001 

7 

* * * * * 

13 

1.6250000000000000 

V 

23 

28657 

1.6180339901755971 

9 4 

46368 

1.6180339882053250 

25 

* « * * * 

75025 

1.6180339889579021 

• 

44 

701408733 

1.6180339887498949 

45 

1134903170 

1.6180339887498949 

46 

1836311903 

1.6180339887498949 



Dissection of the fibonacci Program 


■ #define LIMIT 46 

When for loops are used, they frequently repeat an action up to some limiting value. If 

a symbolic constant such as LIMIT is used, then it serves as its own comment, and its 
value can he changed readily. 

■ long f0 =0, fi = i, n , temp; 

The variables f0, fl, and temp have been declared to be of type long so that the pro¬ 
gram will work properly on machines having 2-byte words, as well as on machines hav¬ 
ing 4-byte words. Note that some of the Fibonacci numbers printed are too large to be 

stored in a 2-byte i nt. There is no need for n to be of type 1 ong; it could just as well be 
of type i nt. 
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for (n = 2; n <= 
temp = fl; 
fl += f0; 
f0 = temp; 


LIMIT; ++n) { 


First n is initialized to 2. Then a test is made to see if n is less than or equal to LIMIT. If 
it is, then the body of the for loop is executed, n is incremented, and control is passed 
back to the top of the for loop. The test, the execution of the body, and the increment¬ 
ing of n all get done repeatedly until n has a value greater than LIMIT. Each time 
through the loop, an appropriate Fibonacci number is computed and printed. The body 
||}f the for loop is the compound statement surrounded by braces. The use of the vari¬ 
able temp in the body is essential. Suppose that instead we had written 


for (n = 2; n <= 
fl += f0; 
f0 = fl; 


LIMIT; ++n) { /* wrong code */ 


Then each time through the loop f0 would not contain the previous Fibonacci number. 

■ printf("%71d%191d%29.16f\n", n, fl, (double) fl / f0); 

Because variables of type 1 ong are being printed, the modifier 1 is being used with the 
conversion character d. Note that the field widths specified here match those in the 
printfO statements used at the beginning of the program. The field widths control 
the spacing between columns in the output. There is no magic in choosing them—just 
whatever looks good. Because a cast is a unary operator, it is of higher precedence than 
division. Thus, the expression 


(double) fl / f0 


is equivalent to 


((double) fl) / f0 


Because the operands of the division operator are of mixed type, the value of f0 gets 
promoted to a doubl e before division is performed. 
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4.14 The goto Statement 

The goto statement is considered a harmful construct in most accounts of modern pro¬ 
gramming methodology. It causes an unconditional jump to a labeled statement some¬ 
where in the current function. Thus, it can undermine all the useful structure provided 
by other flow of control mechanisms (for, whi 1 e, do, i f, swi tch). 

Because a goto jumps to a labeled statement, we need to discuss this latter construct 
first. The syntax of a labeled statement is given by 

labeled_statement ::= label : statement label : : = identifier 
Some examples of labeled statements are 

bye: exit(l); 

L444: a = b + c; 

bugl: bug2: bug3: printf("bug found\n"); /* multiple labels */ 
but not 

333: a = b + c; /* 333 is not an identifier */ 

The scope of a label is within the function in which it occurs. Label identifiers have their 
own name space. This means that the same identifier can be used both for a label and a 
variable. This practice, however, is considered bad programming style and should be 
avoided. 

Control can be unconditionally transferred to a labeled statement by executing a 
goto statement of the form 

goto label; 

An example would be 

goto error; 
error: { 

printf("An error has occurred - bye!\n"); 
exit(l); 


} 


4.15 ▼ The break and conti nue Statements 


179 



-'W-v v ri 

. . 


... 






Both the goto statement and its corresponding labeled statement must be in the body 
of the same function. Here is a more specific piece of code that uses a goto: 

while (scanf, &x) == 1) { 
if (x < 0.0) 

goto negative_alert; 

printf("%f %f\n M , sqrt(x), sqrt(2 * x)); 

} 

negative__alert: printf("Negative value encountered!\n") ; 

Note that this example could have been rewritten in a number of ways without using a 
goto. 

In general, the goto should be avoided. It is a primitive method of altering flow of 
control, which, in a richly structured language, is unnecessary. Labeled statements and 
goto’s are the hallmark of incremental patchwork program design. A programmer who 
modifies a program by adding gotos to additional code fragments soon makes the pro¬ 
gram incomprehensible. 

When should a goto be used? A simple answer is “not at all.” Indeed, one,cannot go 
wrong by following this advice. In some rare instances, however, which should be care¬ 
fully documented, a goto can make the program significantly more efficient. In other 
cases, it can simplify flow of control. This may occur when a special value is tested for 
in a deeply nested inner loop and, when this value is found, the program control needs 
to jump to the outermost level of the function. 


4.1 5 The break and continue Statements 

Two special statements, 

break; and continue; 

interrupt the normal flow of control. The break statement causes an exit from the 
innermost enclosing loop or switch statement. In the following example, a test for a 
negative argument is made, and-if the test is true , then a break statement is used to 
pass control to the statement immediately following the loop. 
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while (1) { 

scanf("%1f", &x) ; 
if (x < 0.0) 

break; /* exit loop if x is negative */ 

printf("%f\n", sqrt(x)); 

} 

/* break jumps to here */ 

This is a typical use of break. What would otherwise be an infinite loop is made to ter¬ 
minate upon a given condition tested by the i f expression. 

The conti nue statement causes the current iteration of a loop to stop and causes the 
next iteration of the loop to begin immediately. The following code processes all char¬ 
acters except digits: 

f nr r- j _ a- -j <• TOT AI * ( 

I U I v l “ XJ f l N I V/ I ML | Tt 1 / h 

c = getcharC); 
if (c >= '0' && c <= '9') 
continue; 

. /* process other characters */ 

/* continue transfers control to here to begin next iteration */ 

> 

The conti nue statement may occur only inside for, whi 1 e, and do loops. As the 
example shows, conti nue transfers control to the end of the current iteration, whereas 
break terminates the loop. With a conti nue statement, a for loop of the form 

for ( exprl ; expr2\ exprS) { 

***** 

continue; 


} 

is equivalent to 
exprl ; 

while (expr2) { 

i * « < * 

goto next; 


next: 

expr3 ; 

} 


which is different from 
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exprl ; 

while iexprZ) { 




} 


continue; 
« * 

expr3 ; 


# » « « 


||fj|j||e(See exercise 29, on page 193, for a convenient way to test this.) 






■ 


1 




mm 






Ifpiif; 


















4.16 The switch Statement 

The switch is a multiway conditional statement generalizing the if-else statement. 
Let us first look at the syntax for a swi tch statement. 


; r™ v: 

-V 


switch-statement 


i * 
m * 






nil" 


switch ( integralexpression ) 

{casestatement \ default-Statement \ switch-block } ? 
casestatement ::= { case constantJntegral_expression : } 1+ 
statement default-statement ::= default ; statement 
switch-block ::= {{declaration-list} opt {case_group}Q+ { default_group} opt } 
case-group ::= { case constant-integral-expression : } 1+ { statement } 1+ 
default-group :: = default : {statement} 1+ 














|2; T The following is a typical example of a swi tch statement: 
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switch (c) { 
case 'a': 

++a_cnt; 
break; 
case 'b': 
case 'B': 

++b_cnt; 

break; 

default: 

++other_cnt; 

} 

Notice that the body of the swi tch statement in the example is a compound statement. 
This will be so in all but the most degenerate situations. The controlling expression in 
the parentheses following the keyword switch must be of integral type. In the example, 
it is just the i nt variable c. The usual automatic conversions are performed on the con¬ 
trolling expression. After the expression is evaluated, control jumps to the appropriate 
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case label. The constant integral expressions following the case labels must all be 
unique. Typically, the last statement before the next case or defaul t label is a break 
statement. If there is no break statement, then execution “falls through” to the next 
statement in the succeeding case. Missing break statements are a frequent cause of 
error in swi tch statements. There may be at most one defaul t label in a swi tch. Typ¬ 
ically, it occurs last, although it can occur anywhere. The keywords case and default 
cannot occur outside of a swi tch. 

The effect of a switch 

1 Evaluate the swi tch expression. 

2 Go to the case label having a constant value that matches the value of the 
expression found in step 1, or, if a match is not found, go to the default label, 
or, if there is no default label, terminate the switch. 

3 Terminate the switch when a break statement is encountered, or terminate the 
swi tch by “falling off the end.” 

Let us review the various kinds of jump statements available to us. These include 
the goto, break, continue, and return statements. The goto is unrestricted in its use 
and should be avoided as a dangerous construct. The break may be used in loops and 
is important to the proper structuring of the switch statement. The continue is con¬ 
strained to use within loops and is often unnecessary. The return statement must be 
used in functions that return values. It will be discussed in the next chapter. 


4.1 7 The Conditional Operator 

The conditional operator ?: is unusual in that it is a ternary operator. It takes as oper¬ 
ands three expressions. 

conditionaLexpression :: = expr ? expr : expr 
In a construct such as 


exprl ? expr2 : expr3 

exprl is evaluated first. If it is nonzero (true), then expr2 is evaluated, and that is the 
value of the conditional expression as a whole. If exprl is zero (false), then expr3 is 
evaluated, and that is the value of the conditional expression as a whole. Thus, a condi- 
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tional expression can be used to do the work of an if-else statement. Consider, for 
example, the code 

if (y < z) 
x = y; 

else 

x = z; 

The effect of the code is to assign to x the minimum of y and z. This also can be accom¬ 
plished by writing 

| x = (y < z) ? y : z; 

Because the precedence of the conditional operator is just above assignment, parenthe¬ 
ses are not necessary. However, parentheses often are used to make clear what is being 
tested for. 

The type of the conditional expression 
exprl ? expr2 : expr3 

is determined by both expr2 and expr3. If they are of different types, then the usual 
s conversion rules are applied. Note carefully that the type of the conditional expression 
does not depend on which of the two expressions expr2 or expr3 is evaluated. The con¬ 
ditional operator ?: has precedence just above the assignment operators, and it associ¬ 
ates from right to left. 


Declarations and initializations 



char 

int 

double 

a = ’a’, b= ’b’; 
i = 1, j = 2; 
x = 7.07; 

/* a has decimal value 97 */ 



Equivalent expression 


Value 


Type 


i nt 


j % B == 0 ? i + 4 : x ((j % 3) == 0) ? (i + 4) : x 7.07 double 


double 
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Summary 

1 Relational, equality, and logical expressions have the i nt value 0 or 1. 

2 The negation operator ! is unary. A negation expression such as ! a has the i ntjj 

value 0 or 1. Remember: !! a and a need not have the same value. > 

3 A chief use of relational, equality, and logical expressions is to test data to affect | 
flow of control. 

4 Automatic type conversions can occur when two expressions are compared that are J 
the operands of a relational, equality, or logical operator. 


5 The grouping construct {.} is a compound statement. It allows enclosed g 

statements to be treated as a single unit. 






6 An i f statement provides a way of choosing whether or not to execute a statement. 

7 The el se part of an i f-el se statement associates with the nearest available ifb|| 

This resolves the “dangling else” problem. T 

8 The whi 1 e, for, and do statements provide for the iterative execution of code. The Jj 
body of a do statement executes at least once. 

9 To prevent an unwanted infinite loop, the programmer must make sure that the, < 
expression controlling the loop eventually becomes zero. The miscoding of control- | 
ling expressions is a common programming error. 

10 The programmer often has to choose between the use of a whi 1 e or a for state- jj 

ment. In situations where clarity dictates that both the control and the indexing be 
kept visible at the top of the loop, the for statement is the natural choice. | 

11 The comma operator is occasionally useful in for statements. Of all the operators 
in C, it has the lowest priority. 


12 The four statement types 

goto break continue return 

cause an unconditional transfer of flow of control. Their use should be minimized. 




» v'; v ' 

|| goto’s are considered harmful to good programming. Avoid them. 

kil4 The switch statement provides a multiway conditional branch. It is useful when 
■ dealing with a large number of special cases. 
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Exercises 






fc 1 Give equivalent logical expressions of the following without negation: 


! (a > b) 

((a <= b && c <= d) 

!(a + 1 b + 1) 

!(a < 1 || b < 2 && c < 3) 




1-5-2 Complete the following table: 


Declarations and initializations 


int a = 1, b = 2, c — 3; 

double x = 1.0; 

Expression 

Equivalent expression 

Value 

a > b && c < d 






a + b < ! c + c 



a-x || b*c&&b/a 




if 




3 Write a program that reads characters from the standard input file until EOF is 
encountered. Use the variables di gi t_cnt and other_cnt to count the number of 
digits and the number of other characters, respectively. 

v ‘ 

;| lS: 4 Write a program that counts the number of times the first three letters of the alpha- 
Igjjff bet (a, A, b, B, c, C) occur in a file. Do not distinguish between lower- and uppercase 
letters. 
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5 Write a program that contains the loop 

while (scanf("%lf", &salary) == 1) { 


wmmmm 

I 

./ :::: Wfi? 







. :«• 


Within the body of the loop, compute a 17% federal withholding tax and a 3% state 
withholding tax, and print these values along with the corresponding salary. Accu¬ 
mulate the sums of all salaries and taxes printed. Print these sums after the pro¬ 
gram exits the while loop. 

6 What gets printed? 


char 

int 


c = 'A'; 
i = 5, j = 10; 


printf("%d %d %d\n", !c, Me, !!!c); 

printf("%d %d %d\n", — ! i, ! - i, ! — i — ! j); 

printf("%d %d %d\n", ! (6 * j + i - c), I i —5, ! j 

7 Explain the effect of the following code: 


- 10 ); 





l nt i; 
while (i = 2) { 

printf("Some even numbers: %d %d %d\n", i, i +2, i +4); 
i = 0; 

} 

Contrast this code with the following: 


int i; 
if (i = 2) 

printf("Some even numbers: %d %d %d\n", i, i +2, i +4); 

Both pieces of code are logically wrong. The run-time effect of one of them is so 
striking that the error is easy to spot, whereas the other piece of wrong code has a 
subtle effect that is much harder to spot. Explain. 
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■Pf 8 What gets printed? 




char 
double 


c = 'A'; 

x = le+33, y = 0.001; 



pri ntf("%d %d %d\n", c == 'a', c == 
printf("%d\n", c == 'A' && c <= 'B' 
printf("%d\n", 1 != !!c == !!!c); 
printf("%d\n", x + y > x - y); 

9 What gets printed? Explain. 


' b ’ , c ! 
II 'C'); 


= 'c'); 


i nt 


i = 7, j = 7; 


if (i == 1) 
if (j == 2) 

printf("%d\n", i 

el se 

printf("%d\n", i = i 
printf("%d\n", i); 


= i + j); 
- j); 


10 Run a test program with this piece of code in it to find out how your compiler 
reports this syntax error. Explain why. 


while (++i < LIMIT) do { 
j = 2 * i + 3; 
printf("j = %d\n", j); 

} 


/* syntax error */ 



/* Many other languages require "do", but not C. */ 

11 Can the following code ever lead to an infinite loop? Explain. (Assume that the val¬ 
ues of i and j are not changed in the body of the loop.) 

printf("Input two integers: "); 
scanf("%d%d", &i, &j); 

while (i * j < 0 && ++i != 7 && j++ != 9) { 

. /* do something */ 

} 
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12 In Section 4.8, “The while Statement,” on page 163, we said that if n has a negative 
value, then 


while (—n) 


/* do something */ 


is an infinite loop. Actually, this is system dependent. Can you explain why? 

13 Write a program that reads in an integer value for n and then sums the integers 
from n to 2 * n if n is nonnegative, or from 2 * n to n if n is negative. Write the 
code in two versions, one using only for loops and the other using only while 
loops. 

14 The function putcharO returns the int value of the character that it writes. What 
does the following code print? 

for (putchar('l'); putcharf'2'); putchar('3')) 
putcharC'4'); 

15 For the C language, answer the following true-false questions: 

Every statement ends in a semicolon. 

Every statement contains at least one semicolon. 

Every statement contains at most one semicolon. 

There exists a statement with precisely 33 semicolons. 

There exists a statement made up of 35 characters that contains 33 semicolons. 

16 In Section 4.10, “An Example: Boolean Variables,” on page 170, we presented a pro¬ 
gram that prints a table of values for some boolean functions. Execute the program 
and examine its output. For the 32 different inputs, exactly half of them (16 in num¬ 
ber) have majority value 1. Write a program that prints a table of values for the 
majority function for, say, 7 boolean variables. Of the 128 different inputs, how 
many have majority value 1? State the general case as a theorem and try to give a 
proof. (Your machine can help you find theorems by chocking special cases, but in 
general it cannot give a proof.) 

17 Write three versions of a program that computes the sum of the first n even inte¬ 
gers and the sum of the first n odd integers. The value for n should be entered 
interactively. In the first version of your program, use the code 

for (cnt =0, i =1, j =2; ent < n; ++cnt, i += 2, j += 2) 
odd_sum += i, even_sum += j; 






Note the prolific use of the comma operator. The code is not very good, but it will 
give you confidence that the comma operator works as advertised. In the second 
version, use one or more for statements but no comma operators. In the third ver¬ 
sion, use only whi 1 e statements. 

Choose one version of the program that you wrote in exercise 16, on page 188, and 
incorporate into it the following piece of code: 

do { 

printf("Input a positive integer: "); 
scanf("%d", &n); 
if (error = (n <= 0)) 

printf("\nERR0R: Do it again!\n\n"); 

} while (error); 

Then write another version of the program that uses a whi 1 e statement instead of a 
do statement to accomplish the same effect. 

Until interrupted, the following code prints TRUE FOREVER on the screen repeat¬ 
edly. (In UNIX, type a control-c to effect an interrupt.) 

while (1) 

printf(" TRUE FOREVER "); 

Write a simple program that accomplishes the same thing, but use a for statement 
instead of a whi le statement. The body of the for statement should contain just 
the empty statement “; ”. 


20 The following code is meant to give you practice with short-circuit evaluation: 


i nt 


a = 0, b = 0, x; 




x = 0 && (a = b = 777); 
printf("%d %d %d\n", a, b, x); 
x = 777 (| (a = ++b); 
printf("%d %d %d\n", a, b, x); 

What gets printed? First, write down your answers. Then write a test program to 
check them. 
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if C(x != 0.0) && (Cz - x) / x * x < 2.0)) 

(b) if (C(z - x) / x * x < 2.0) && (x != 0.0)) 

22 Suppose that we have three statements called stl, st2, and st3. We wish to write an 
i f-el se statement that will test the value of an i nt variable i and execute differ 
ent combinations of the statements accordingly. The combinations are given in the 
following table: 


execute 

* 

l 

execute 

i 

execute 

Stl 

1 

st2 

i 

stl, st2 

st2 

2 

stl, st3 

2 

Q t 1 Of / 
i)ll | u L.L 

st3 

3 

Stl 

3 

st2, st3 


Write programs that read in values for i interactively. Use appropriate printfQ 
statements to check that the flow of control mimics the action described in the 
tables. For example, statements such as 

if (i == i) 

printf("statement_l executed\n"); 
can be used to show that your programs run properly. 

23 A polynomial in x of at most degree 2 is given by 

ax 1 + bx + c 

Its discriminant is defined to be 
b 2 - 4ac 

We are interested in the square root of the discriminant. If the discriminant is non- 
negative, then 

Jb 4 ac 

has its usual interpretation, but if the discriminant is negative, then 


means 


id-tb* + 4 ac' 


21 The semantics of logical expressions imply that order of evaluation is critical 
some computations. Which of the following two alternate expressions is most 
to be the correct one? Explain. 
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where i = 7-1 or equivalently, i 2 = - 1. 

Write a program that reads in values for a, b, and c and prints the value of the 
square root of the discriminant. For example, if the values 1,2, and 3 are read in, 
then i *2.828427 should be printed. 

Write a program that repeatedly reads in values for a, b, and c and finds the roots 
of the polynomial 

ax 2 + bx+ c 

Recall that the roots are real or complex numbers that solve the equation 
ax 2 + bx + c = 0 

When both a = 0 and b = 0, we consider the case “extremely degenerate” and leave 
it at that. When a = 0 and b != 0, we consider the case “degenerate.” In this case 
the equation reduces to 

bx + c = 0 

and has one root given by x = - c/b. 

When a* 0 (the general case), the roots are given by 

_ 1 _ 

2 a 


The expression under the square root sign is the discriminant. If the discriminant is 
positive, then two real roots exist. If the discriminant is zero, then the two roots are 
real and equal. In this case we say that the polynomial (or the associated equation) 
has multiple real roots. If the discriminant is negative, then the roots are complex. 
For each set of values for a, b, and c, your program should print the computed 
root(s) along with one of the following messages: 

degenerate two real roots extremely 
degenerate two complex roots multiple real roots 



root^ = 2 ~(~ b + 7 b - 4ac) 


root, 
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For example, if the values 1, 2, and 3 are read in for a, b, and c, respectively, then 
the following should be printed: 

two complex roots: rootl = -1.000000 + i*1.414214 

root2 = -1.000000 - 1*1.414214 

25 A truth table for a boolean function is a table consisting of all possible values for its 
variables and the corresponding values of the boolean function itself. In Section 
4.10, “An Example: Boolean Variables,” on page 170, we created a truth table for the 
majority function and two other functions. In that table we used 1 and 0 to repre¬ 
sent true and false , respectively. Create separate truth tables for the following bool¬ 
ean functions: 

(a) bl || b2 || b3 || b4 

(b) bl && b2 && b3 && b4 

(c) I(!bl || b2) && (!b3 || b4) 

Use the letters T and F in your truth tables to represent true and false , respectively. 
Hint: Use the #def i ne mechanism to define a B00LEX, and write your program to 
operate on an arbitrary BOOL EX. 

26 Write a program to check the proper pairing of braces. Your program should have 

two variables: one to keep track of left braces, say left_cnt, and the other to keep 

track of right braces, say right_cnt. Both variables should be initialized to zero. 

Your program should read and print each character in the input file. The appropri¬ 
ate variable should be incremented each time a brace is encountered. If ri ght_cnt 
ever exceeds the value of 1 ef t_cnt, your program should insert the character pair 
?? at that point in the output. After all the characters in the input file have been 
processed, the two variables 1 eft_cnt and ri ght_.cnt should have the same value. 
If not, and 1 eft_cnt is larger than ri ght_cnt, then a message should be printed 
that includes the number of right braces missing as a series of that many }’s. For 
example, 

ERROR: Missing right braces: }}} 

Use the macros getchar() and putcharO for input/output. Test your program by 
processing some files containing your own C code. 
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27 Extend the program that you wrote in the previous exercise so that it deals with 
both braces and parentheses simultaneously. 

28 In Section 4.8, “The whi 1 e Statement,” on page 165, we presented a program that 
counts blanks, digits, and letters. Modify the program so that lower- and uppercase 
letters are counted separately. 

29 Rewrite the following two pieces of code to avoid using break or conti nue: 


while (c = getcharO) { 
if ( C == ’E') 
break; 

n+" ■ 

if (c’>= ’0' && c <= ’9’) 
++digit_cnt; 

} 


i = -5; 
n = 50; 

while (i < n) { 

++i; 

if (i == 0) 
continue; 
total += i; 

printf("i = %d and total 

} 


%d\n", i, total); 


30 Show how a whi 1 e statement can be rewritten as a goto statement and an i f state¬ 
ment. Which is the better construct, and why? 
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31 Because it leads to “spaghetti code," the goto is seldom used in today’s program¬ 
ming world. The following code illustrates how just a few goto statements can 
make the flow of control incoherent: 

d = b * b — 4.0 * a * c * 
if (d == 0.0) 
goto LI; 

else if (d > 0.0) { 
if (a != 0.0) { 

rl = Ob + sqrt(d)) / (2.0 * a); 
r2 = Ob - sqrt(d)) / (2.0 * a); 
goto L4; 

} 

el se 

goto L3; 

} 

else 

goto L2; 

LI: 

if (a != 0.0) 

rl = r2 = -b / (2.0 * a); 
el se 

goto L3; 
goto L4; 

L2: 

if (a != 0.0) { 

printf("imaginary roots\n"); 
goto L4; 

} 

L3: printf("degenerate case\n"); 

L4: . 

Note how the programmer kept adding different cases and had to repatch the code 
until the program logic became obscure. Rewrite this code without using goto 
statements. 

32 Here is a simple way to test the effect of a conti nue statement in the body of a for 
loop. What gets printed? 

for (putchar(' V ); putchar(’2'); putchar('3')) { 
putchar('4'); 
continue; 
putchar('5'); 


} 
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The mathematical operation min(x, y) can be represented by the conditional 
expression 

(x < y) ? x : y 

In a similar fashion, using only conditional expressions, describe the mathematical 
operations 


min(x, y, z) 


and 


max(x, y, z, w) 


Which of the following two statements is legal? Study the syntax for a switch state¬ 
ment to answer this, then write a test program to see if your compiler complains. 


switch (1); 

switch (1) switch (1); 


/*version 1 */ 
/* version 2 */ 


In ANSI C, labels have their own name space, whereas in traditional C, they do not. 
This means that in ANSI C, the same identifier can be used for a variable and a 
label, although it is not considered good programming practice to do so. Write a 
test program containing the following code and execute it on your ANSI C compiler: 


int L = -3; 
if (L < 0) 
goto L; 

printf("L = %d\n", L) ; 

L: printf("Exiting label test!\n") 


/* Lisa variable */ 


/* L is a label */ 


If a traditional C compiler is available to you, check to see that your program will 
not compile. 

(Advanced) Let a be a positive real number, and let the sequence of real numbers 
x i be given by 


Xn = 1 . 


X • , 1 

l + 1 


1, a, 

2 ix i + F } 

i 


for 7=0, 1, 2, 


It can be shown mathematically that 


x. -> Ja 

l 


^ (X> 


This algorithm is derived from the Newton-Raphson method in numerical analysis. 
Write a program that reads in the value of a interactively and uses this algorithm to 
compute the square root of a. As you will see, the algorithm is very efficient. (None¬ 
theless, it is not the algorithm used by the sqrtQ function in the standard library.) 
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Declare x0 and xl to be of type double, and initialize xl to 1. Inside a loop do the 
following: 

= xl; /* save the current value of xl */ 

xl = 0.5 * (xl + a / xl); /* compute a new value of xl */ 

The body of the loop should be executed as long as x0 is not equal to xl. Each time 
through the loop, print out the iteration count and the values of xl (converging to 
the square root of a) and a - xl * xl (a check on accuracy). 

37 (Advanced) Modify the program you wrote in the previous exercise, so that the 

square roots of 1, 2.n are all computed, where n is a value that is entered inter 

actively. Print the number, the square root of the number, and the number of itera 
tions needed to compute it. (You can look in a numerical analysis text to discover 
why the Newton-Raphson algorithm is so efficient. It is said to be “quadratically 
convergent.”) 

38 (Advanced) The constant e, which is the base of the natural logarithms, is given to 
41 significant figures by 

e = 2.71828 18284 59045 23536 02874 71352 66249 77572 


Define 

r 1 \n 

x = I 1 + - for n= 1, 2,... 
n \ n) 

It can be shown mathematically that 


% n e &S H ~> ©o 


Investigate how to calculate e to arbitrary precision using this algorithm. You will 
find that the algorithm is computationally ineffective. (See exercise 36, on page 
195.) 


39 (Advanced) In addition to the algorithm given in the previous exercise, the value 
for e is also given by the infinite scries 

. 1111 
e ~ l + — h- 1 — j-l 

1! 2! 3! 4! 

The above algorithm is computationally effective. Use it to compute e to an arbi¬ 
trary precision. 
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The heart of effective problem solving is problem decomposition. Taking a problem 
and breaking it into small, manageable pieces is critical to writing large programs. In C, 
the function construct is used to implement this “top-down” method of programming. 

A program consists of one or more files, with each file containing zero or more func¬ 
tions, one of them being a mainQ function. Functions are defined as individual objects 
that cannot be nested. Program execution begins with main(), which can call other 
functions, including library functions such as pri ntf () and sqrt(). Functions operate 
with program variables, and which of these variables is available at a particular place in 
a function is determined by scope rules. In this chapter we discuss function definition, 
function declaration, scope rules, storage classes, and recursion. 


5.1 Function Definition 

The C code that describes what a function does is called the function definition. It must 
not be confused with the function declaration. A function definition has the following 
general form: 

type function_name( parameter list ) { declarations statements } 

Everything before the first brace comprises the header of the function definition, and 
everything between the braces comprises the body of the function definition. The 
parameter list is a comma-separated list of declarations. An example of a function defi¬ 
nition is 
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int factorial(int n) /* header */ 

{ /* body starts here */ 

int i, product = 1; 
for (i = 2; i <= n; ++i) 
product *= i; 
return product; 

} | 

The first i nt tells the compiler that the value returned by the function will be con¬ 
verted, if necessary, to an i nt. The parameter list consists of the declaration i nt n. 
This tells the compiler that the function takes a single argument of type i nt. An 
expression such as factorial (7) causes the function to be invoked, or called. The 
effect is to execute the code that comprises the function definition, with n having the 
value 7. Thus, functions act as useful abbreviating schemes. Here is another example of 
a function definition: 


void wrt_address(void) 

{ 

printf("%s\n%s\n%s\n%s\n%s\n\n", 

’' * * * * * * * it * * * * it * it * it Vc * * 11 

" ** SANTA CLAUS 

" ** NORTH POLE 

" ** EARTH 

1 1 * * * ft ft ft ft ft it it it it it it it it it it it it M ^ » 


The first void tells the compiler that this function returns no value; the second void 
tells the compiler that this function takes no arguments. The expression 

wrt_addressO 

causes the function to be invoked. For example, to call the function three times we can 
write 

for (i =0; i <3; ++i) 
wrt_address(); 

A function definition starts with the type of the function. If no value is returned, then 
the type is voi d. If the type is something other than voi d, then the value returned by 
the function will be converted, if necessary, to this type. The name of the function is 
followed by a parenthesized list of parameter declarations. The parameters act as 
placeholders for values that are passed when the function is invoked. Sometimes, to 
emphasize their role as placeholders, these parameters are called the formal parame¬ 
ters of the function. The function body is a block, or compound statement, and it too 
may contain declarations. Some examples of function definitions are 
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void nothing(void) { } 

double twice(double x) 

{ 

return (2.0 * x); 

} 

int all_add(int a, int b) 

{ 

int c; 

v I a ( > 

return (a + b + c); 


/* this function does nothing */ 


■ J§T if a function definition does not specify the function type, then it is i nt by default. For 
1 example, the last function definition could be given by 




all_add(int a, int b) 

{ 


However, it is considered good programming practice to specify the function type 
explicitly. (See exercise 5, on page 236, for further discussion.) 

Any variables declared in the body of a function are said to be “local” to that func¬ 
tion. Other variables may be declared external to the function. These are called “global” 
variables. An example is 




#include <stdio.h> 


i nt 


a = 33; 


/* a is external and initialized to 33 */ 


int main(void) 

{ 

int b = 77 

printf("a = ! 
printf("b = ! 
return 0; 


%d\n", a); 
%d\n", b); 


/* bis local to main() */ 
/* a is global to main() */ 


In traditional C, the function definition has a different syntax. The declarations of the 
variables in the parameter list occur after the parameter list itself and just before the 
first brace. An example is 
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The parameter type list is typically a comma-separated list of types. Identifiers are! 
optional; they do not affect the prototype. For example, the function prototype f 

void f(char c, int i); is equivalent to void f(char, int); f 

The identifiers such as c and i that occur in parameter type lists in function prototypes 
are not used by the compiler. Their purpose is to provide documentation to the pro¬ 
grammer and other readers of the code. The keyword void is used if a function takes 
no arguments. Also, the keyword voi d is used if no value is returned by the function. If 
a function takes a variable number of arguments, then the ellipses (...) are used. See, 
for example, the function prototype for pri ntf () in the standard header file stdio.h. 

Function prototypes allow the compiler to check the code more thoroughly. Also, val¬ 
ues passed to functions are properly coerced, if possible. For example, if the function 
prototype for sqrt() has been specified, then the function call sqrt(4) will yield the 
correct value. Because the compiler knows that sqrt() takes a doubl e, the i nt value 4 
will be promoted to a doubl e and the correct value will be returned. (See exercise 5, on 
page 236, for further discussion.) 

In traditional C, parameter type lists are not allowed in function declarations. For 
example, the function declaration of sqrt () is given by 

double sqrt(); /* traditional C style */ 

Even though ANSI C compilers will accept this style, function prototypes are preferred. 
With this declaration, the function call sqrt(4) will not yield the correct value. (See 
exercise 5, on page 236.) 

Function Prototypes in C++ 

In C++, function prototypes are required, and the use of voi d in the parameter type list 
in both function prototypes and function definitions is optional. Thus, for example, in 

^ ' V ' l w » | ■ 

void f() is equivalent to void f(void) 

Note carefully that this results in a conflict with traditional C. In traditional C, a func¬ 
tion declaration such as 

int f(); 


means that f () takes an unknown number of arguments. In traditional C, void is not a 
keyword. Thus, it cannot be used in a parameter list in a function declaration or func¬ 
tion definition. 


wff-m 
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C- 5.4 


An Example: Creating a Table of Powers 


mm this section, we give an example of a program that is written using a number of tunc- 
sJpSfoS For simplicity, we will write all the functions one after another m one file. The 

®1Pfurpose of the program is to print a table of powers. 


#include <stdio.h> 
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#define 

long I 
void I 
voi d I 


power(int, int); 
prn_heading(void);_ 
prn_tbl_of_powers(i nt); 
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int main(void) 

£ 

prn_heading(); 

p rn_tbl_of_powe rs(N); 
return 0; 

} 

void prn_heading(void) 

{ printf("\n::::: A TABLE OF POWERS : 

} 

void prn_tbl_of_powers(int n) 

{ 

int i,j! 

for (i = 1; i <= n; ++i) { 
for (j =1; j <= n; ++j) 

-jf (j == 1) 

printf("%1d", power(i, j)); 
else 

printf("%91d", power(i, 3 )) 
putchar('\n')! 


::::\n\n") 





} 
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long power(int m, int n) 

{ 

i nt i; 

long product = 1; 

for (i = 1; i <= n; ++ -j) 
product *= m; 
return product: 


Here is the output of the program; 


* * • w m 

* * * ■ * 


A TABLE OF POWERS 


* * * * 


x i a « , 


1 

32 

243 


1 

64 

729 


1 

128 

2187 


Note that the first column consists of integers raised to the first power, the second col¬ 
umn consists of integers raised to the second power, and so forth, in our program we 
have put the function prototypes near the top of the file. This makes them visible 
throughout the rest of the file. We used the type long so that the program will produce 
the same output whether the machine has 2- or 4-byte words. Note that the function 
power () computes the quantity m n , which is in raised to the nth power. 

Oui program illustrates in a very simple way the idea of top-down design. The pro¬ 
grammer thinks of the tasks to be performed and codes each task as a function. If a 
particular task is complicated, then that task, in turn, can be subdivided into other 
tas cs, each coded as a function. An additional benefit of this is that the program as a 
whole becomes more readable and self-documenting. 


5.5 


Function Declarations from the Compiler’s 
Viewpoint 


To the compiler , function declarations are generated in various ways: by 7 function invo¬ 
cation by function definition, and by explicit function declarations and function proto- 
ypes. II a function call, say f (x), is encountered before any declaration, definition, or 
prototype for it occurs, then the compiler assumes a default declaration of the form 
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gg^int f 0; 

Nothing is assumed about the parameter list for the function. Now suppose that the fol¬ 
lowing function definition occurs first: 


int f(x) 
double x; 

{ 


/* traditional C style */ 


« «»*<* 


This provides both declaration and definition to the compiler. Again, however, nothing 
is assumed about the parameter list. It is the programmer’s responsibility to pass only 
a single argument of type double. A function call such as f (1) can be expected to fail 
because 1 is of type i nt, not doubl e. Now suppose that we use, instead, an ANSI C style 
definition: 


int f(double x) 


/* ANSI C style */ 


* % M- • * 


The compiler now knows about the parameter list as well. In this case, a function call 
such as f (1) can be expected to work properly. When an int gets passed as an argu¬ 
ment, it will be converted to a double. 

A function prototype is a special case of a function declaration. A good programming 
style is to give either the function definition (ANSI C style) or the function prototype or 
both before a function is used. A major reason for including standard header files is 
because they contain function prototypes. 


Limitations 


Function definitions and prototypes have certain limitations. The function storage class 
specifier, if present, can be either extern or stati c, but not both; auto and regi ster 
cannot be used. (See Section 5.11, “Storage Classes,” on page 216.) The types “array of 
...” and “function returning ...” cannot be returned by a function. However, a pointer 
representing an array or a function can be returned. (See Section 6.6, “Arrays as Func¬ 
tion Arguments,” on page 256.) The only storage class specifier that can occur in the 
parameter type list is regi ster. Parameters cannot be initialized. 
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5.6 An Alternate Style for Function Definition Order 

Suppose we want to write a program in a single file. If our program contains more than 
one function definition, we usually put #i ncl udes and #defi nes at the top of the file, 
other program elements such as templates of enumeration types (Section 7.5, “Enumer¬ 
ation Types," on page 345) and templates of structures and unions (Section 9.4, “Using 
Structures with Functions,” on page 416) next, then a list of function prototypes, and 
finally the function definitions, starting with mai n(). Because function definitions also 
serve as function prototypes, an alternate style is to remove the list of function proto¬ 
types and to put the function definition of any function that gets called before the func¬ 
tion definition of its caller. In particular, mai n () goes last in the file. 

Let us illustrate this alternate style by modifying the tbLofipowers program that we 
wrote in Section 5.4, “An Example: Creating a Table of Powers,” on page 203. Here is 
another way of writing that program: 

#include <stdio.h> 

fdefine N 7 

void prn_heading(void) 

{ 

t t * s « 

} 

long power(int m, int n) 

{ 

w * « » » 

} 

void prn__tbl_of_powers(int n) 

{ 

« « « « E 

printf("%ld", power(i, j)); 

* * < * * 

} 

int main(void) 

{ 


prn_heading(); 
prn_tbl_of_powers(N); 
return 0; 
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Because power() is called by prn_tbl_of_powers(), the function definition for 
power () must come first. Similarly, the function definitions of the two functions called 
by inainQ must come before the function definition of main(). (See exercise 10, on 
page 238, for further discussion.) 

Although we favor the top-down style that puts mai n() first, we will occasionally use 
this alternate style as well. 


5.7 Function Invocation and Call-by-Value 

A program is made up of one or more function definitions, with one of these being 
main(). Program execution always begins with main(). When program control encoun¬ 
ters a function name, the function is called, or invoked. This means that program con¬ 
trol passes to that function. After the function does its work, program control is passed 
back to the calling environment, which then continues with its work. 

Functions are invoked by writing their name and an appropriate list of arguments 
within parentheses. Typically, these arguments match in number and type (or compati¬ 
ble type) the parameters in the parameter list in the function definition. The compiler 
enforces type compatibility when function prototypes are used. All arguments are 
passed “call-by-value.” This means that each argument is evaluated, and its value is 
used locally in place of the corresponding formal parameter. Thus, if a variable is 
passed to a function, the stored value of that variable in the calling environment will 
not be changed. Here is an example that clearly illustrates the concept of “call-by¬ 
value”: 

#include <stdio.h> 

int compute_sum(int n); 

int main(void) 

{ 

int n = 3, sum; 

printf("%d\n", n); /* 3 

sum = compute„sum(n); 
printf("%d\n", n); /* 3 

printf("%d\n", sum); /* 6 

return 0; 

} 


1 s printed */ 

is printed */ 
is printed */ 
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int compute_sum(int n) /* sum the integers from 1 to n ’'/ 

{ 

int sum = 0; 

for ( ; n > 0; —n) /* stored value of n is changed */ 

sum += n; 
return sum; 

} 

Even though n is passed to compute_sum() and the value of n in the body ol that func¬ 
tion is changed, the value of n in the calling environment remains unchanged. It is the 

value of n that is being passed, not n itself. 

The “call-by-value” mechanism is in contrast to that of “call-by-reference. In Section 
6.3, “Call-by-Reference,” on page 252, we will explain how to accomplish the effect of 
“call-by-reference.” This is a way of passing addresses (references) of variables to a 
function that then allows the body of the function to make changes to the values of 
variables in the calling environment. 

Function invocation means: 

1 Each expression in the argument list is evaluated. 

2 The value of the expression is converted, if necessary, to the type of the formal 
parameter, and that value is assigned to its corresponding formal parameter at 
the beginning of the body of the function. 

3 The body of the function is executed. 

4 If a return statement is executed, then control is passed back to the calling envi¬ 
ronment. 

5 If the return statement includes an expression, then the value of the expression 
is converted, if necessary, to the type given by the type specifier of the function, 
and that value is passed back to the calling environment, too. 

6 If the return statement does not include an expression, then no useful value is 
returned to the calling environment. 

7 If no return statement is present, then control is passed back to the calling envi 
ronment when the end of the body of the function is reached. No useful value is 

returned. 

8 All arguments are passed “call-by-value.” 
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5.8 Developing a Large Program 

Typically, a large program is written in a separate directory as a collection of .h and .c 
files, with each ,c file containing one or more function definitions. Each .c file can be 
recompiled as needed, saving time for both the programmer and the machine. (See Sec¬ 
tion 11.17, “The Use of make,” on page 532 for further discussion.) 

Let us suppose we are developing a large program called pgm. At the top of each of 
our x files we put the line 






IS 

i?; 


#include "pgm.h" 

When the preprocessor encounters this directive, it looks first in the current directory 
for the file pgm.h. If there is such a file, then it gets included. If not, then the preproces¬ 
sor looks in other system-dependent places for the file. If the file pgm.h cannot be 
found, then the preprocessor issues an error message and compilation stops. 

Our header file pgm.h may contain #includes, #defines, templates of enumeration 
types, templates of structure and union types, other programming constructs, and 
finally a list of function prototypes at the bottom. Thus, pgm.h contains program ele¬ 
ments that are appropriate for our program as a whole. Because the header file pgm.h 
occurs at the top of each .c file, it acts as the “glue” that binds our program together. 


in 


./V 


m 


m 


Create a .h file that is included in all the .c files 

pgm.h 


mam.c 


#include "pgm.h 11 


#includes 

#defines 


list of fct prototypes 


fete 


#inc1ude "pgm.h" 


prn.c 


#1nc1ude "pgm.h" 




* » * 9 * 


■ • « « » 





















210 Chapters ▼ Functions 


Let us show a very simple example of how this works. We will write our program in a 
separate directory. It will consist of a .h file and three .c files. Typically, the name of the 
directory and the name of the program are the same. Here is our program: 

In file pgm.h 

#include <stdio.h> 

#include <stdlib.h> 

#define N 3 

void fctlfint k); 
void fct2(void); 
void wrt_info(char *); 

In file main.c 

#include "pgm.h" 

int main(void) 

{. 

char ans; 
int i, n = N; 

printf ("%s", 

"This program does not do very much.\n" 

"Do you want more information? "); 
scanf(" %c", &ans); 
if (ans == 'y' || ans == 'Y') 
wrt_info("pgm"); 
for (i =0; i < n; ++i) 

.C "i / 4 ‘A » 

t Mil *m 9 J 

printf("Bye!\n"); 
return 0; 

} 

In file fct.c 

#include "pgm.h" 

void fctl(int n) 

{ 

* * * 
int i; 

printf("Hello from fctl()\n"); 
for (i =0; i < n; ++i) 
fct2(); 


} 
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void fct2(void) 

{ 

printf("HeT1o from fct2()\n"); 

} 


J In file wrt.c 


#include "pgm.h" 

void wrt_info(char *pgm_name) { 

printf("Usage: %s\n\n", pgm_name); 
printf("%s\n", 

"This program illustrates how one can write a program\n" 

"in more than one file. In this example, we have a\n" 
"single .h file that gets included at the top of our\n" 
"three .c files. Thus, the .h file acts as the \"glue\"\n" 
"that binds the program together.\n" 

"\n" 

"Note that the functions fctl() and fct2() when called\n" 
"only say \"hello.\" When writing a serious program, the\n 
"programmer sometimes does this in a first workingXn" 
"version of the code.\n"); } 


b Note that we used the type char * (pointer to char). (We will discuss pointers in Sec¬ 
tion 6.2, “Pointers,” on page 248.) We compile the program with the command 

cc -o pgm main.c fct.c prn.c 


| The compiler makes the executable file pgm along with three .o files that correspond to 
.c files. (In MS-DOS, they are .obj files.) The .o files are called object files. (For further 
| discussion about these object files and how the compiler can use them, see Section 
f 11.13, “The C Compiler,” on page 522.) 

X For a more interesting example of a program written in separate files, see Section 7.6, 
“An Example: The Game of Paper, Rock, Scissors,” on page 348. Other examples are 
| available via ftp and the Internet. If you are connected to the Internet, try the following 
command: 

ftp bc.aw.com 

After you have made a connection, you can change directory (cd) to be, and then cd to 
kelley_pohl, and then look around, A large program is much easier to investigate if you 
have the source code. Then you can print out whatever is of interest and, with the help 
| of a debugger, you can step through the program if you wish to do so. (See Section 
§11.19, “Other Useful Tools,” on page 539.) 


pMptfifill 

Bp-sip 

fa 
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What Constitutes a Large Program? 

For an individual, a large program might consist of just a few hundred lines of code. 
Which lines get counted? Usually all the lines in any READ_ME files (there should be at 
least one), the .h files, the ,e files, and the makefile. (See Section 11.17, “The Use of 
make,” on page 532.) In UNIX, the word count utility wc can be used to do this: 

wc READ_ME *.h *.c makefile 

In industry, programs are typically written by teams of programmers, and a large pro¬ 
gram might be considered anything over a hundred thousand lines. 

The style of writing a program in its own directory as a collection of .h and .c files 
works well for any serious program, whether it is large or small, and all experienced 
programmers follow this style. To become proficient in the style, the programmer has 
to learn how to use make or some similar tool. (See Section 11.17, “The Use of make,” 
on page 532.) 


5.9 Using Assertions 


The C system supplies the standard header file assert.h, which the programmer can 
include to obtain access to the assertO macro. (See Section 8.11, “The assertf) 
Macro,” on page 388, for further discussion.) Here, we want to show how the program¬ 
mer can use assertions to facilitate the programming process. Consider the following 
code: 

#include <assert.h> 

#include <stdio.h> 

int f(int a, int b); 

int g(int c); 
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int main(void) 

{ 

int a, b, c; 


scanf("%d%d", &a, &b); 


c = f(a, b); 
assert(c > 0); 


/* an assertion */ 


* « * » « 


§§»§§? jiii 

'*/. ■ -' \ ' 2 , U2V.’' 






mm 


m 


md If tbe ex P ress i° n passed as an argument to assertO is false, the system will print a 
§lt|j|| messa S e > an d the program will be aborted. In this example, we are supposing that f O 
Ijjjlj embodies an algorithm that is supposed to generate a positive integer, which gets 
Hi returned. Perhaps someone else wrote the code for the function definition of f () and 

HP SU K that the Value returned by f ° is positive. In the function’defi- 

4ffci 0^ or ^ Jt ma ^ k e im portant that the arguments a and b satisfy certain condi- 
HK tlons ' L * us su PP°se that we expect a to have either 1 or -1 as its value and that wc 
Hte- expect b t0 lie m the int «rval [7,11], We can use assertions to enforce these conditions. 


v-t u-- 




int f(int a, int b) 

{ 


■m. 


?-V X 


m 


kH 




y. -■■ 




assert(a == 1 || a == -1); 
assertfb >= 7 && b <= 11); 

y 7 


HfS 


Assertions are easy to write, add robustness to the code, and help other readers of the 
methodology tand ** ^ ^ ° f aSsertions is cohered good programming 



5.10 Scope Rules 

The basic rule of scoping is that identifiers are accessible only within the block in which 
they are declared. They are unknown outside the boundaries of that block. This would 

use e d t0 f°f l0W ’ Aat Pn,gramniers ’ for a varlet y of reasons, choose to 

, . air | e ^entifier in different declarations. We then have the question of which 

o ject the identifier refers to. Let us give a simple example of this state of affairs. 
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int a = 2; 

printf("%d\n", a); 

{ 

*i nt ~ 5 * 

printf("%d\n", a); 

> 

printf("%d\n", ++a); 


/* outer block a */ 

/* 2 is printed */ 

/* inner block a V 
/* 5 is printed */ 

/* back to the outer block */ 
/* 3 is printed */ 


An equivalent piece of code would be 


int a_outer = 2; 

printf("%d\n", a_outer); 

{ 

int a_inner = 5; 

printf("%d\n", a_inner); 

} 

printf("%d\n", ++a_outer); 

} 

Each block introduces its own nomenclature. An outer block name is valid unless an 
inner block redefines it. If redefined, the outer block name is hidden, or masked, from 
the inner block. Inner blocks may be nested to arbitrary depths that are determined by 
system limitations. The following piece of code illustrates hidden variables in three 

nested blocks: 


int a=l, b=2, c=3; 
printf("%3d%3d%3d\n", a, b, c); 

{ 

int b = 4; 

float c = 5.0' 

printf("%3d%3d%5.lf\n", a, b, c) 
a = b; 

{ 

int c; 

printf("%3d%3d%3d\n", a, b, c); 

} , , 

printf("%3d%3d%5.lf\n", a, b, c) 

> 

printf("%3d%3d%3d\n", a, b, c); 


/* 1 2 3 */ 


/* 1 4 5.0 


/* 4 4 4 */ 

/* 4 4 5.0 */ 

/* 4 2 3 V 


m 


m 


ml 




mm 


-J A 


Ip 


sssAssm.-. 


a 



5.10 ▼ Scope Rules 215 


Xhe i nt variable a is declared in the outer block and is never redeclared. Therefore, it is 
available in both nested inner blocks. The variables b and c are redeclared in the first 
inner block, thus, hiding the outer block variables of the same name. Upon exiting this 
block, both b and c are again available as the outer block variables, with their values 
intact from the outer block initialization. The innermost block again redeclares c, so 
that both c (inner) and c (outer) are hidden by c (innermost). 


if--'' 


""If; parallel and Nested Blocks 


(ggllTWo blocks can come one after another, in which case the second block has no knowl- 
. edge of the variables declared in the first block. Such blocks, residing at the same level, 
MU are called parallel blocks. Functions are declared in parallel at the outermost level. The 
following code illustrates two parallel blocks nested in an outer block: 


i nt 


a, b; 


float 


oat 


/* inner block 1 */ 

/* int a is known, but not int b */ 


/* inner block 2 */ 


/* int b is known, but not int a */ 

/* nothing in inner block 1 in known */ 


9 * • • * 




Parallel and nested blocks can be combined in arbitrarily complicated schemes. The 
chief reason for blocks is to allow memory for variables to be allocated where needed. If 
memory is scarce, block exit will release the storage allocated locally to that block, 
allowing the memory to be used for some other purpose. Also, blocks associate names 
in their neighborhood of use, making the code more readable. Functions can be viewed 
as named blocks with parameters and return statements allowed. 
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Using a Block for Debugging 

One common use for a block is for debugging purposes. Imagine we are in a section o 
code where a variable, say v, is misbehaving. By inserting a block temporarily into th 
code, we can use local variables that do not interfere with the rest of the program. 
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Declarations of variables within blocks are implicitly of storage class automatic. The 
keyword auto can be used to explicitly specify the storage class. An example is 


auto int 
auto float 


a, b, c; 
f; 


{ /* debugging starts here */ 

stsitii c ~i nti cnt — 0" 

printf("*** debug: cnt = %d v = %d\n", ++cnt, v); 




H 


W&Pr* . ’i 

mSm& i&gs. 




3ZXf2$ 


The variable cnt Is local to the block. It will not interfere with another variable of th 
same name in an outer block. Because its storage class is stati c, it retains its old vali 
when the block is reentered. Here, it is being used to count the number oi times tl 
block is executed. (Perhaps we are inside a for loop.) We are assuming that the variable 
v has been declared in an outer block and is therefore known in this block. We ar| 
printing its value for debugging purposes. Later, after we have fixed our code, this 
block becomes extraneous, and we remove it. 



’ Because the storage class is automatic by default, the keyword auto is seldom used. 

When a block is entered, the system allocates memory for the automatic variables, 
pthin that block, these variables are defined and are considered “local" to the block. 
When the block is exited, the system releases the memory that was set aside for the 
automatic variables. Thus, the values of these variables are lost. If the block is reen¬ 
tered, the system once again allocates memory, but previous values are unknown. The 
igbody of a function definition constitutes a block if it contains declarations. If it does, 
then each invocation of the function sets up a new environment. 


■MM- 
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.11 Storage Classes 

Every variable and function in C has two attributes: type and storage class. The for, 
storage classes are automatic, external, register, and static, with corresponding key 
words 


mSsftT/wvr'v''" "w. 





The Storage Class extern 

One method of transmitting information across blocks and functions is to use external 
variables. When a variable is declared outside a function, storage is permanently 
assigned to it, and its storage class is extern. A declaration for an external variable can 
look just the same as a declaration for a variable that occurs inside a function or block. 
Such a variable is considered to bo global to all functions declared after it, and upon 
exit from the block or function, the external variable remains in existence. The follow¬ 
ing program illustrates these ideas: 


: 

» ' •' 1 
5 $%' ? 

3 .- 



#include <stdio.h> 


auto 


extern 


register 


static 


By far the most common storage class for variables is automatic. However, the 
granimer needs to know about all the storage classes. They all have important uses 


The Storage Class auto 
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tj:- 

IP 

SIS 

mmm 


'' % 

' . a; 

mSSs^s 


- <■ 


»%|ff 

r|(,. 

■i wmm 

mmSm 
jpfBp 

t 5 1: 

.1 


int a = 1, b 
int f(void); 


2, c = 3 


/* global variables */ 

/* function prototype */ 


Variables declared within function bodies are automatic by default. Thus, automatic is 
the most common of the four storage classes. If a compound statement starts with vari 
able declarations, then these variables can be acted on within the scope of the enclosing 
compound statement. A compound statement with declarations is called a Nock to dis^ 
tinguish it from one that does not begin with declarations. 
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int main(void) 

{ 

printf("%3d\n", f()); 
printf("%3d%3d%3d\n", a, b, c) 
return 0; 

} 

int f(void) 

{ 

int b, c; 

a = b = c = 4; 
return (a + b + c); 


/* 12 is printed */ 

/* 4 2 3 is printed */ 


/* b and c are local */ 

/* global b, c are masked */ 


-- 
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Note that we could have written 


eg . . ■/■y' MwBl 

... sm mm 

i • Umk 


extern int 


b - 2, c 


/* global variables */ 


i# 

C** n.*** *v ‘ _ 'f~< 

&^§$SB a 
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This use of extern will cause some traditional C compilers to complain. In ANSI C, this! 
use is allowed but not required. Variables defined outside a function have external stor¬ 
age class, even if the keyword extern is not used. Such variables cannot have automatic 
or register storage class. The keyword static can be used, but its use is special, as| 
explained in Section 5.12, “Static External Variables,” on page 221. f 

The keyword extern is used to tell the compiler to “look for it elsewhere, either in 
this file or in some other file.” Let us rewrite the last program to illustrate a typical use 
of the keyword extern. 1 

In file filel.c 


II .0rs*;M3 


&aX*A:**aVI ‘.SS&eSg 


- ? 4, 

v"'; ■ 1 

s* 


MB 


. ■ - ■■ 

w. 


v,. : 
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#include <stdio.h> 


■■Mm. 


iHf " - U 


b Informatl / )n can be Passed into a function two ways: by use of external variables and 

^L^to^dS^ t rm eCh T Sm H Alth ° Ugh th6re are exce P tions < th e use of the 
parameter mechamsm is the preferred method. It tends to improve the modularity of 

the code, and it reduces the possibility of undesirable side effects 

F ° n< A for ™ ot S !f C *f ct occurs when a Unction changes a global variable from within 
its body rather than through its parameter list. Such a construction is error prone Cor 

ect practice is to effect changes to global variables through the parameter and return 
mechanisms. Adhering to this practice improves modularity and readability and 
because changes are localized, programs are typically easier to write and maintain. 

All functions have external storage class. This means that we can use the keyword 
- xte rn 111 function definitions and in function prototypes. For example, ' 

1 extern double sin(double); 

| * functlon P rot °we for the sinC) function; for its function definition we can write 


int a = 1, b 
int f(void); 


2, c 


/* external variables */ 


int main(void) 

{ 

printf("%3d\n", f()); 
printfC^d^Od^GdXn' 1 , a, b, c) ;. 
return 0; 

} 

In file file2.c 
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extern double sin(double x) 
{ 


he Storage Class register 


int f(void) 

{ 

extern int 
i nt 


b, c 


/* look for it elsewhere */ 


a = b = c = 4; 
return (a + b + c); > 

The two files can be compiled separately. The use of extern in the second file tells 
the compiler that the variable a will be defined elsewhere, either in this file or in some 
other. The ability to compile files separately is important when writing large programs 
External variables never disappear. Because they exist throughout the execution 
of the program, they can be used to transmit values across functions. They may, hoW-| 
ever, be hidden if the identifier is redefined. /Another way of conceiving of external vari 
ables is to think of them as being declared in a block that encompasses the whole 
program. 
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he storage class register tells the compiler that the associated variables should be 
ored in high-speed memory registers, provided it is physically and semantically possi- 

is imn 0 ossihfp e T Se t reS0UrC i and semantic constraints sometimes make 

locatnraDDronlTT f de<?aUltS l ° au,oniatic ^cnever the compiler cannot 
locate an approbate physical register. Typically, the compiler has only a few such 

’herw!se aVa ^ ° f ^ for use and cannot be allocated 

.eed^Wh^^n T- ° f St ° rage Cl f S register is an attempt to improve execution 

ost ienrtv^Irr«lrf° nC H r H ’ T pr ° grammer raa V choose a few variables that are 

ndidates for such t^ifm f 0 ' 11 t0 be of storage class register. Common 

. examnle mclude Ioop variables and function parameters. Here is 
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The declaration 

register i; is equivalent to register int i, 

If a storage class is specified in a declaration and the type is absent, then the type is 
i nt by default. 

Note that in our example the register variable t was declared as , cl ° s ® 
use as possible. This is to allow maximum availability of the physical re ® 

them only when needed. Always remember that a register declaration is taken I as 

advice to the compiler. 

The Storage Class static 

Static declarations have two important and distinct uses The more el e“ e ^®ry use is to 
allow a local variable to retain its previous value when the block is reentered 
contrast to ordinary automatic variables, which lose their value upon block exit and 
must be reinitialized. The second and more subtle use is in connection with external 

declarations. We will discuss this in the next section. 

As an example of the value-retention use of stati c, we will write the outline of a 
function that behaves differently depending on how many times it has een ca e . 


IMy,: 


pi 

i sni 

II® 




f 

Bl lp afeg*' 


- ■■■■; 

1 


5.1 2 ▼ Static External Variables 


221 




.. 

Npiiiiiiiiiis 


llli§£ 


] 




ihis 


lis 




§g||§&§ 


■ • - 

v-V-r 


il 


•;C • f- 


-c'/dy-r \\fyy, 

" ®V 

v 

Ax: 




if 

..- •-'*C- m 

.. ,-VA; 


' sv - 


Wmm 


mz-y 


m- 

■pi 
IBfes 


•ir-;..--,' 

ttb; 


mm 


■ 1 ■. - 1 


'.. . •:•_ f.<- - “'-yyi b 
v -'Tt' 

Jpi 
;- .• 

ii '? WMMi : V 

$1 

|| r- 

' ’-r‘ 

ffimM .t v^‘ 

i ■ .j 

fe®®® 1 




y:;.r 






void f(void) 

{ 

static int 

++cnt; 
if (cnt % 2 
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else 


/* do something */ 

/* do something different w / 
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The first time the function is invoked, the variable cnt is initialized to zero. On func 
tion exit, the value of cnt is preserved in memory. Whenever the funedon is mv^ed 
again, cnt is not reinitialized. Instead, it retains its previous value from ‘ 
the function was called. The declaration of cnt as a static !nt mside of f O 1 P ; 
private to f Q. If it were declared outside of the function, then other functions could 

access it, too. 
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5.12 Static External Variables 

The second and more subtle use of stati c is in connection with external declarations. 
With externa] constructs it provides a “privacy” mechanism that is very important for 
program modularity. By privacy, we mean visibility or scope restrictions on otherwise 
accessible variables or functions. 

At first glance, static external variables seem unnecessary. External variables already 
retain their values across block and function exit. The difference is that static external 
variables are scope-restricted external variables. The scope is the remainder of the 
source file in which they are declared. Thus, they are unavailable to functions defined 
earlier in the file or to functions defined in other files, even if these functions attempt 
to use the extern storage class keyword. 


void f(void) 
{ 


static int 

void g(void) 
{ 


/* v is not available here */ 


/* static external variable */ 


/* v can be used here */ 


Let us use this facility to provide a variable that is global to a family of functions but, 
at the same time, is private to the file. We will write two pseudo random-number gener¬ 
ators, both of which use the same seed. (The algorithm is based on linear congruential 
methods; see The Art of Computer Programming, 2d ed., vol. 2, Seminumerical Algo¬ 
rithms, by Donald Ervin Knuth (Reading, MA: Addison-Wesley, 1981.) 

/* A family of pseudo random number generators. */ 


#define 
#define 
#define 
#define 
#define 


INITIAL-SEED 

MULTIPLIER 

INCREMENT 

MODULUS 

FLOATING_MODULUS 


17 

25173 

13849 

65536 

65536.0 


static unsigned seed = INITIAI—SEED; /* external, but */ 

/* private to this file */ 
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unsigned random(void) 

* seed = (MULTIPLIER * seed + INCREMENT) % MODULUS, 
return seed; 

} 

double probability(void) 

{ seed - (MULTIPLIER * seed + INCREMENT) % MODULUS; 
return (seed / FLOATING_MODULUS), 

} 

. a a«narentlv random sequence of integer values 

The function randomO produces an a\ p y produces an apparently random 
between 0 and MODULUS. The function probabt 11 ty() produces an app 

sequence oi floating values between 0 and 1. , w value of the vari- 

„ h™ Art a rail m randomO or probabilityO produces a new \aiue oi u 
Notice that a call to ranaomu oi pi is a static external variable, it is 

able seed that depends on its old valu . function calls. We can now create 

private to this file, and its value is preserve e generators without worrying 

functions in other files that invoke these random-number generators w 

about side effects. for function definitions and proto- 

ty ,r™ — = T*. ~ 

visible only within the file In whlcli *ey throughout its own file, 

hut £ Sin developing private — of function del- 

initions. 


static int g(void); 

void f(int a) 

{ 


/* function prototype */ 

/* function definition */ 

/* g() is available here, *, 
/* but not in other files * 


static int g(void) 


/* function definition */ 
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5,13 Default Initialization 

In C, both external variables and static variables that are not explicitly initialized by the 
programmer are initialized to zero by the system. This includes arrays, strings, point¬ 
ers, structures, and unions. For arrays and strings, this means that each element is ini¬ 
tialized to zero; for structures and unions, it means that each member is initialized to 
zero. In contrast to this, automatic and register variables usually are not initialized by 
the system. This means they start with “garbage” values. Although some C systems do 
initialize automatic variables to zero, this feature should not be relied on; doing so 
makes the code nonportable. 


5.14 Recursion 

A function is said to be recursive if it calls itself, either directly or indirectly. In C, all 
functions can be used recursively. In its simplest form, the idea of recursion is straight¬ 
forward. Try the following program: 

#include <stdio.h> 

int main(void) 

{ 

printf(" The universe is never ending! "); 
main(); 
return 0; 

} 

Here is another example of a recursive function. It computes the sum of the first n pos- 
itive integers. 


int sum(int n) 

{ 

if (n <= 1) 
return n; 
el se 

return (n + sum(n 


l)) 
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The recursive function sum() is analyzed as illustrated in the following table. The base 
case is considered, then working out from the base case, the other cases are considered. 


P8S 

fey:- 



Function call 

Value returned 



sum(l) 

1 



sum(2) 

2 + sum(l) 

or 

2 + 1 

sum(3) 

3 + sum(2) 

or 

3 + 2 + 1 

sum(4) 

4 + sum(3) 

or 

A jl, 3 4 * ? x 1 

nr ~ c* jl 


Simple recursive routines follow a standard pattern. Typically, there is a base case (or 
cases) that is tested for upon entry to the function. Then there is a general recursive 
case in which one of the variables, often an integer, is passed as an argument in such a 
way as to ultimately lead to the base case. In sum(), the variable n was reduced by 1 
each time until the base case with n equal to 1 was reached. 

Let us write a few more recursive functions to practice this technique. For a nonnega¬ 
tive integer n, the factorial of n, written n !, is defined by 



logical operations in the body of the function are beyond the integer precision available 
to til6 system. 

M As m sum(), when an integer n is passed to factori al (), the recursion activates n 
II neste copies of the function before returning level by level to the original call This 
means that n function calls are used for this computation. Most simple recursive func¬ 
tions can be easily rewritten as iterative functions. 


/* iterative version */ 


0 ! = 1 , 
or equivalently, 

0 != 1 , 


n ! = n(n - 1) ■•■ 3 ■ 2 • 1 


for n > 0 


' ^ All 
: % 



n != n((n - 1)!) 


for n > 0 


Thus, for example, 5! = 5 ■ 4 • 3 • 2 • 1 = 120. Using the recursive definition of facto¬ 
rial, it is easy to write a recursive version of the factorial function. 

int factorial(int n) /* recursive version */ 

{ 

if (n <= 1) 
return 1; 
else 

return (n * factorial(n - 1)); 

} V 

This code is correct and will work properly within the limits of integer precision avail¬ 
able on a given system. However, because the numbers n ! grow large very fast, the 
function call factorial (n) yields a valid result for only a few values of n. On our sys¬ 
tem the function call factori al (12) returns a correct value, but if the argument to the 
function is greater than 12, an incorrect value is returned. This type of programming 
error is common. Functions that are logically correct can return incorrect values if the 
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int factorial(int n) 

{ 

int product = 1; 

for ( ; n > 1; —n) 
product *= n; 
return product; 


For a given input value, both factorial functions return the same value over the positive 

integers, but the iterative version requires only one function call regardless of the value 
passed in. 

TJ 16 next example illustrates a recursive function that manipulates characters. It can 

jg|| easily be rewritten as an equivalent iterative function. We leave tills as an exercise. The 

program Irst prompts the user to type in a line. Then by means of a recursive function 
| it writes the line backwards. 

jg /* Write a line backwards. */ 

Iv #include <stdio.h> 
gl void wrt_it(void); 
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int main(void) 

{ 

printf("Input a line 
wrt_it(); 
printf("\n\n"); 
return 0; 


void wrt_it(void) 

{ 

int c: 










pli. „... 


if ((c = getcharO) != ’\n') 
wrt_it(); 
putchar(c); 
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_ _ nn nf xoence when prompted, then the fol- 
If the user types in the line si ng a song P 

lowing appears on the screen. 

Input a line: sing a song of sixpence 
ecnepxis fo gnos a gnis 


x" i 




mm 


yfft: 


Efficiency Considerations 







mmzm i 




m 

'■XZZii 




§§£§ 


Si 


Dissection of the wrLbkwrds Program 


B printf("Input a line: "); 

wrt_it(); . 

• * a A ThP ncpr writes a line and terminates it by typing® 
First, a prompt to the user is printed. recursive function wrt_i t O isfj 

a carriage return, which is the character \n. then the recursi jg| 

invoked. 

m void wrt_it(void) Jjj 

{ 11 

int c; fgg 

This function has no test for EOF; thus, c could be of type char. jjj 

■ if ((c = getcharO) '•= ’\n') j§ 

wrt_it(); ;■ 

, . . t n p W ii ne then wrt it() is invoked again, which causjji 
If the character read m is not a newlme.th ^ has itg own loca l storage for thegl 

another character to be read in, a ^d f or ‘ h function wrt_it(), and each 

variable c. Each c is local to a parhcuiai invocauon ot are stacked until M 

c stores one of the characters m the input stream, me tunci .|| 

the newline character is read. 


Spfj Many algorithms have both iterative and recursive formulations. Typically, recursion is 
ly «mrp elegant and requires fewer variables to make the same calculation. Recursion 
§f|§sj. takes care of its bookkeeping by stacking arguments and variables for each invocation. 
Sjjte 'rhk stacking of arguments, while invisible to the user, is still costly in time and space. 
jjjpjlltOn some machines a simple recursive call with one integer argument can require eight 
32-bit words on the stack. 

Let us discuss efficiency with respect to the calculation of the Fibonacci sequence, a 

jig™gjg'&v ’ 

Baldfe . particularly egregious example. This sequence is defined recursively by 

|#gp§r 


ffi — 0 


fl 


/i+1 = fi + fi-1 


for i = 1. 2 


Except for fo and f), every element in the sequence is the sum of the previous two ele- 
Bjte mpnts. The sequence begins 0, 1, 1, 2, 3, 5, ... . Here is a function that computes 
Sj^feFibonacci numbers recursively: 

^§mim 

int fibonacci(int n) 

{ 

if (n <= 1) 
return n; 
el se 

return (fibonacci (n 



1) + fibonacci(n - 2)); 


} 


jijy As the following table shows, a large number of function calls is required to compute 
Hpfhe nth Fibonacci number for even moderate values of n : 




■ putchar(c); _ 

Only after the newline character is read does 

wrt_itO now prints the value stored m its toe^ first character* 
is printed, then the character just before it, and so on, until tn if 1 

printed. Thus, the input line is reversed. 



mm 






Value of n 

Value of 
fibonacci(n) 

Number of function calls required to 
recursively compute fibonacci(n) 

0 

0 

1 

1 

1 

1 

2 

1 

3 

• » * * • 

23 

28657 

92735 

24 

46368 

150049 

42 

267914296 

866988873 

43 

433494437 

1402817465 
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Although it is seductive to use recursion, one must be careful about run-time limita¬ 
tions and inefficiencies. It is sometimes necessary to recode a recursion as an equiva¬ 
lent iteration. 

Some programmers feel that because the use of recursion is inefficient, it should not 
be used. The inefficiencies, however, are often of little consequence—as in the example 
of the quicksort algorithm given in Section 8.5, “An Example: Sorting with qsortO," on 
page 372. For many applications, recursive code is easier to write, understand, and 
maintain. These reasons often prescribe its use. 


5.1 5 An Example: The Towers of Hanoi 
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In the game called Towers of Hanoi, there are three towers labeled A, B, and C. The 
game starts with n disks on tower A. For simplicity, suppose n is 3. The disks are num¬ 
bered from 1 to 3, and without loss of generality we may assume that the diameter of 
each disk is the same as its number. That is, disk 1 has diameter 1 (in some unit of mea¬ 
sure), disk 2 has diameter 2, and disk 3 has diameter 3. All three disks start on tower A 
in the order 1, 2, 3. The object of the game is to move all the disks on tower A to tower 
C. Only one disk may be moved at a time. Any of the towers A, B, or C may be used for 
the intermediate placement of disks. After each move, the disks on each of the towers 
must be in order. That is, at no time can a larger disk be placed on a smaller disk. 


Towers of Hanoi 
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VVe will write a program that uses recursion to produce a list of moves that shows 
how to accomplish the task of transferring the n disks from tower A to tower C. Here is 
the program: 

In file hanoi.h 

#include <assert.h> 

#include <stdio.h> 

#include <stdlib.h> 

extern int ent; /* count of the number of moves */ 

int get_n_from_user(void); 

void move(int n, char a, char b, char c); 

In file main.c 


#include "hanoi.h" 


i nt 


ent = 0 


/* count of the number of moves */ 


int main(void) 

{ 

int n; 


} 


/* number of disks */ 


n = get_n_from_user(); 
assert(n > 0); 

/* 

// Move n disks from tower A to tower C, 

// using tower B as an intermediate tower. 

V 


move(n, 'A' 
return 0: 


1 D f 


B 


'C'); 


/* recursive fet */ 
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In file get.c 

#include "hanoi.h" 

int get_n_from_user(void) 

{ 

int n: 


printf("%s", 

"—\n" 

"TOWERS OF HANOI:\n" 

"\n" 

"There are three towers: A, B, and C.\n" 

"\n" 

"The disks on tower A must be moved to tower C. Only one\n 
"disk can be moved at a time, and the order on each tower\n 
"must be preserved at each step. Any of the towers A, B,\n 
"or C can be used for intermediate placement of a disk.Vn" 
"\n" 

"The problem starts with n disks on Tower A.\n" 

"\n" 

"Input n: "); 

if (scanf("%d", &n) != 1 || n < 1) { 

printf("\nERROR: Positive integer not found - bye!\n\n"); 
exit(l); 

} 

printf("\n"); 
return n; 


In file move.c 


#include "hanoi.h" 

void movefint n, char a, char b, char c) 

{ 

if (n == i) { 

++cnt; 

printf("%5d: %s%d%s%c%s%c.\n", cnt, 

"Move disk ", 1, " from tower ", a, " to tower ", c); 

} 

else { 

move(n - 1, a, c, b); 

I * 

* l V 1 » V* J 

printf("%5d: %s%d%s%c%s%c.\n", cnt, 

"Move disk ", n, " from tower ", a, " to tower ", c); 
move(n - 1, b, a, c); 
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In our header file, we declare the variable cnt to be of storage class extern, which 
means “look for it here or in some other file,” and we provide two function prototypes 
at the bottom of the file. In main.c , just above the function definition for mai n(), we 
declare the external variable cnt, and inside of mai n(), we first get n from the user and 
then call the recursive function 



move(n 


t A I 


t n t 


’C) 


We read this as “move n disks from tower A to tower C, using tower B as an intermedi¬ 
ate tower.” Alternatively, we can think of this as “move n disks from tower A through 
tower B to tower C.” We want to explain in detail how the function nioveQ works, but 
before we do so, here is the output of the program if we enter 3 when prompted: 


TOWERS OF HANOI: 


There are three towers: A, B, and C. 


The disks on tower A must be moved to tower C. Only one 
disk can be moved at a time, and the order on each tower 
must be preserved at each step. Any of the towers A, B, 
or C can be used for intermediate placement of a disk. 


Wei 


The problem starts with n disks on Tower A 


Input n: 3 


1: Move 
2: Move 
3: Move 
4: Move 
5: Move 
6: Move 
: Move 


di sk 
di sk 
disk 
di sk 
disk 
di sk 
disk 


1 from 

2 from 
1 from 

3 from 

1 from 

2 from 
1 from 


tower 

tower 

tower 

tower 

tower 

tower 

tower 


A to 
A to 


C to 
A to 


B to 
B to 


A to 


tower C. 
tower B. 
tower B. 
tower C. 
tower A. 
tower C. 
tower C. 
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Dissection of the recursive function move() 

a void move(int n, char a, char b, char c) 

This is the header to the function definition for moveQ . In main() , where we first call 
moveO, the arguments passed to the function are n, ‘A’, ‘B \ and C . thus, m this 
first call, the parameters a, b, and c in the function definition have t te \ a ues > » *; 

and 1 C’, respectively. Note, however, that when moveO calls itself withm the 0 Y of 

move (), this association need not hold. 

a if (n == 1) { f 

++cnt; ! 

printf("%5d: %s%d%s%c%s%c.\n", cnt, | 

"Move disk ", 1, " from tower ", a, to tower , c), 

} | 

If the number of disks on tower a is 1, then we increment the counter and then move g 
the disk from tower a to tower c. That is, we print the instructions to do so. |j 


1: move disk 1: 
2: move disk 2: 
3: move disk 1: 
4: move disk 3: 
5: move disk 1: 
6: move disk 2: 
7: move disk 1: 


2 

1 2 
1 2 


2 3 
12 3 


To save space, the disks on each tower are written from left to right instead of verti¬ 
cally. From this output, w ; e can tell at a glance that a larger disk is never placed on a 
smaller one. 

According to legend, each day one of the monks in the monastery that attends the 
three towers of Hanoi moves one of the disks. At the beginning of time, tower A started 
with 64 disks. When the last disk gets moved, according to legend, the world will come 
to an end. 


else { 

move(n - 1, a, c, b); 

++cnt; 

printf("%5d: %s%d%s%c%s%c.\n", cnt, 
"Move disk ", n, " from tower ", a, 
move(n - 1, b, a, c); 
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to tower 


c); 


When there are n disks on tower a with n greater than 1, we first move n - 1 disks; jj 
from tower a to tower b, using tower c as an intermediate tower. Secon , we mcremen |jj 
the counter and move the last disk on tower a, disk n, from tower a to tower c. a 1S) 
we print the instructions for doing this. Finally, we move the n - 1 disks on tower bjj 

tower c, using tower a as an intermediate tower. jjj 
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The code for the move() function is elementary, but strikingly effective. However, tog 
actually see that the output provides valid instructions for solving the game, one nee s .. 
to draw pictures that illustrate each move. This is, indeed, tedious to do. Later, alter w|g 
have learned about the use of arrays, we can rewrite our program so that the ou pu 
illustrates the disks on each tower at each step. Here is the output from the programg 

when n has the value 3: ; 
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Summary 

1 Functions are the most general structuring concept in C. They should be used to 
implement “top-down” problem solving—namely, breaking up a problem into 
smaller and smaller subproblems until each piece is readily expressed in code. 

2 A return statement terminates the execution of a function and passes control back 
to the calling environment. If the return statement contains an expression, then 
the value of the expression is passed back to the calling environment as well. 

3 A function prototype tells the compiler the number and type of arguments that are 
to be passed to the function. The general form of a function prototype is 

type function_name ( parameter type list) ; 

The parameter type list is typically a comma-separated list of types, with optional 
identifiers. If a function takes no arguments, then the keyword void is used. If the 
function returns no value, then the function type is void. 

4 Arguments to functions are passed by value. They must be type compatible with 
the corresponding parameter types given by the function definition or the function 
prototype. 
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5 The storage class of a function is always extern, and its type specifier is i nt unless 
otherwise explicitly declared. The return statement must return a value compati¬ 
ble with the function type. 

6 The principal storage class is automatic. Automatic variables appear and disappear 
with block entry and exit. They can be hidden when an inner block redeclares an 

outer block identifier. 

7 Scope rules are the visibility constraints associated with identifiers, for example, if 
in a file we have 

static void f(void) 

{ 


static int a, b, c; 


then f () will be known throughout this file but in no other, and a, b, and c will be 
known only in this file and only below the place where they are declared. 

8 The external storage class is the default class for all functions and all variables 
declared outside of functions. These identifiers may be used throughout the pro¬ 
gram. Such identifiers can be hidden by redeclaration, but their values cannot be 

destroyed. 

9 The keyword extern is used to tell the compiler to “look tor it elsewhere, either in 
this file or in some other file.” 

10 The storage class regi ster can be used to try to improve execution speed. It is 
semantically equivalent to automatic. 

11 The storage class stati c is used to preserve exit values of variables. It is also used 
to restrict the scope of external identifiers. This latter use enhances modularization 
and program security by providing a form of privacy to functions and variables. 

12 External and static variables that are not explicitly initialized by the programmer 
are initialized to zero by the system. 

13 A function is said to be recursive if it calls itself, either directly or indirectly. In C, 
all functions can be used recursively. 



Exercises 

1 Write a function double power (double x, int n) that will compute x”, the nth 
power of x. Check to see that it computes 3.S 7 correctly. (The answer is 
6433.9296875.) 

2 Use the library function sqrtQ to write a function that returns the fourth root of 
its int argument k. The value returned should be a double. Use your function to 
make a table of values. 

3 What gets printed? Explain. 

#include <stdio.h> 

int z; 

void f(int x) 

{ 

x = 2; 

Z JL — V * 

} 

int main(void) 

{ 

_ U « 

J j 

f (z); 

printf("z = %d\n", z); 
return 0; 

} 

4 In traditional C, the general form of a function definition is 


type function_name{ parameter list ) 
declarations of the parameters 
{ declarations statements } 

See Section 5.1, “Function Definition,” on page 198, for an example. Rewrite the 
function definition for f () in the previous exercise in this style. Check to see that 
your compiler will accept it. Does the program produce the same output? (It 
should.) 
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5 In this exercise, we want to experiment with function declarations. We will use the 
function pow() from the mathematics library, but instead of including math.h, 
will supply our own function declaration. Begin by executing the following program, gj|| 


#include <stdio.h> 

double powfdouble, double) 






ifSI ' 


int main(void) 

{ 

printf("pow(2.0, 3.0) 
return 0; 

} 




%g\n", pow(2.0, 3.0)); 


ill 




int main(void) 

{ 

int a, b, cnt = 0, i; 

srand(time(NULL)); 
for (i = 0; i < 1000; ++i) { 
a = randQ % 3 + 1; 
b = randQ % 30 + 1; 
if (b - a <= 1) 
continue; 

assert(b - a > 2); 
printf("%3d\n", ++cnt); 


/* from 1 to 3 */ 
/* from 1 to 30 */ 


} 

return 0; 


m 
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Next, change the pri ntf () statement to 

printf("pow(2, 3) = %g\n", pow(2, 3)); 

and run the program again. Even though you have passed i nts to pow() instead o| 
doubl es, do you get the correct answer? You should, because the compiler, know¬ 
ing that double arguments are required, is able to coerce the values supplied mto 
the right form. Next, replace the function prototype by “ 


mm ip^ 

--- ~r, 

- 


} 




double pow() 


/■■< traditional style */ 


What happens now? Finally, remove the function declaration completely. Although.^ 
your program cannot be expected to run correctly, it should compi e. oes l . ls t * 
important to remember that even though the C system provides object code f<§ 
functions in libraries, it is the responsibility of the programmer to provide correc^ 

function prototypes. |I 

6 In this exercise we want to experiment with an assertion. Execute the following pro¬ 
gram so that you understand its effects. Note: This is not a typical use o assertion* 

#include <assert.h> |f 

#inclucle <stdio.h> il 

#include <stdlib.h> k 

#include <time.h> 




What values of a and b cause the assertion to fail? On average, how many times do 
you expect to go through the loop? 

Use the function probabilityO that we wrote in Section 5.12, “Static External 
Variables,” on page 222, to create two files, one with 100 random numbers and the 
other with 1,000 random numbers. All the numbers will lie in the range from 0 to 1. 
If the numbers are truly randomly distributed, then we expect the average to 
approximate 0.5 as we look at larger and larger sets of them. Compute the average 
of the numbers in each of your files. Typically, the average of the numbers in the 
larger file will be closer to 0.5 than the average of the numbers in the smaller file. Is 
it? 

A polynomial of degree 2 in x is given by 
ax 2 + bx + c 

Write the code for the function 

double f(double a, double b, double c, double x) 

{ 


that will compute the value of an arbitrary polynomial of degree 2. Use the identity 
(ax + b)x + c = ax 2 + bx + c 

to minimize multiplications. Use your function to create a file of points that could 
be used to plot the polynomial x 2 - 3x+ 2 on the interval [0, 3], The variable x 
should go from 0 to 3 in steps of 0.1. By examining your file, can you tell where the 
roots of the polynomial are? 
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9 Experiment with the tbLof_powers program presented in Section 5.4, “An Example: 
Creating a Table of Powers,” on page 203. How many rows of the table can you com- 
pute before the powers that are printed become incorrect? Where appropriate, try 
using the type doubl e. Can you get larger numbers this way? 


10 In the tbLof.'.powers program in Section 5.4, “An Example: Creating a Table of Pow¬ 

ers,” on page 203, we wrote the function definition for mainQ first. What happens^ 
if you remove the function prototypes and put main() last? Your compiler should^ 
be happy with this new arrangement. Is it? What happens if you put mai n() first# 
but do not include any function prototypes? Can you guess what your compiler will* 
complain about? One of our C compilers issues warnings but produces an execut-f 
able file, whereas our C++ compiler issues errors and refuses to compile the code.pi 
What happens on your system? 4^ 

11 Write a function i nt i s_prime(i nt n) that returns 1 if n is a prime and 0 otherJi 

wise. Hint: If k and n are positive integers, then k divides n if and only if n % k ha|fa* 
value zero. IS 

•; • 

12 Write a function, say i s_fi b_prime(), that checks whether the nth Fibonacci num¬ 

ber is prime. Your function should call two other functions: the iterative version ofggj 
fi bonacci Q that was given in Section 4.13, “An Example: Fibonacci Numbers," ofg® 
page 175, and the function i s_pri me() that you wrote in the previous exercise. F<j|gj| 
n between 3 and 10, it is true that the nth Fibonacci number is prime if and only if iijj 
is prime. Use your function i s_fi b„prime() to investigate what happens when nisp 
bigger than 10. vj 

13 A famous conjecture, called the Goldbach. conjecture, says that every even integer 

greater than 2 has the property that it is the sum of two prime numbers. Compute^ 
have been used extensively to test this conjecture. No counterexample has evjggj 
been found. Write a program that will prove that the conjecture is true for all thSj 
even integers between the symbolic constants START and FINISH. For example, if 
you write 'SUf 3 


#define 
#define 


START 

FINISH 


700 

1100 


▼ Exercises 239 


then the output of your program might look like this: 


Every even number greater than 2 is the sum of two primes: 

700 = 17 + 683 
702 = 11 + 691 

704 = 3 + 701 

***** 

1098 = 5 + 1093 

1100 = 3 + 1097 


Hint: Use the function i s_prime() that you wrote in exercise 11, on page 238. 


Write a function that finds all factors of any particular number. For example, 


9 = 3x3, 17 = 17 (prime), 52 = 2 x 2 x 13 

Factoring small numbers is easy, and you should have no trouble writing a program 
to do this. Note, however, that factoring in general is very difficult. If an integer 
consists of a few hundred digits, then it may be impossible to factor it, even with a 
very large computer. 

This exercise gives you practice on understanding the scope of identifiers. What 
gets printed by the following code? First, hand simulate the code and record your 
answers. Then write a program to check your answers. 

int a = 1, b = 2, c = 3; 
a -j-= b +— *h~c ; 

printf("%5d%5d%5d\n", a, b, c); 

{ 

float b = 4.0; 
int c; 

a += c = 5 * b; 

printf("%5d%5.1f%5d\n", a, b, c); 

} 

pri ntf("%5d%5d%5d\n", a, b, c); 

Rewrite “The universe is never ending!” recursion so that it terminates after 17 
calls. Your program should consist of a single inain() function that calls itself 
recursively. Hint: Use a static variable as a counter. 
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] 7 The following function produces incorrect values on some systems: 

int factorial(int n) /* wrong */ 

{ 

if (n == 0 || n == 1) 
return 1; 
el se 

return (n * factorial(—n)); 

} 

Test the function on your system. Explain why the values produced by the function 
are system-dependent, 

18 The greatest common divisor of two positive integers is the largest integer that is a 
divisor of both of them, For example, 3 is the greatest common divisor of 6 and 15, 
and 1 is the greatest common divisor of 15 and 22. Here is a recursive function that 
computes the greatest common divisor of two positive integers: 

int gcdfint p, int q) 

{ 

int r; 

if ((r = p % q) == 0) 
return q; 
el se 

return gcd(q, r); 


First, write a program to test the function. Then write and test an equivalent itera¬ 
tive function. 

19 in some systems the keyword extern is used in function declarations and proto¬ 
types in the standard header files. This was a common practice in traditional C sys¬ 
tems, but it is usually not done in ANSI C systems. Is this done in your system? 
Hint: Look, for example, in math.h. 
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20 In the program that follows, we have purposely declared some external variables at 
the bottom of the file. What gets printed? 

#include <stdio.h> 

int main (void) 

{ 

extern int a, b, c; /* look for them elsewhere */ 

printf("%3d%3d%3d\n" , a, b, c); 
return 0; 

> 

int a = 1, b = 2, c=3; 

Now change the last line of the program to 
static int a = 1, b = 2, c = 3; 

The variables at the bottom of the file are now static external, so they should not be 
available in main(). Because the external variables referred to in main() are not 
available, your compiler should complain. Docs it? (We find that most compilers get 
confused.) 

21 When declaring a variable in traditional C, it is permissible to write the storage class 
specifier and the type specifier in any order. For example, we can write 

register int i; or int register i; 

In ANSI C, the storage class specifier is supposed to come first. Nonetheless, most 
ANSI C compilers will accept either order. (If they did not, then some traditional 
code would not compile.) Check to see if the reverse order works on your compiler. 

22 Describe the behavior of the following program: 

#include <stdio.h> 

#include <stdlib.h> 

#define FOREVER 1 
#define STOP 17 
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int main(void) | 

{ , I 

void f(void); 

while (FOREVER) 

f(); I 

return 0; I 

} 


void f(void) 

{ 

static int cnt = 0; 
printf("cnt = %d\n", ++cnt); 
if (cnt == STOP) 
exit(0); 

} 

23 Let be a given positive integer. For i =0, 1, 2, ...define 

n i+1 = n ( -/2 if ^ is even; 3n +1 if n, is odd; 

The sequence stops whenever has the value 1. Numbers that are generated this 
way are called “hailstones.” Write a program that generates some hailstones. The 

function 


void hailstones(int n) 

{ 

» i * « I 


should be used to compute and print the sequence generated by n. The output of 
your program might look as follows: 


Hailstones generated by 77: 


77 

232 

116 

44 

22 

11 

26 

13 

40 

16 

8 

A 


Number of hailstones generated: 23 


58 

34 

20 

2 


29 

17 

10 

1 


88 

52 

5 


You will find that all the sequences you generate are finite. Whether this is true in 
general is still an open question. 
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24 Write a coin-tossing program that uses the random-number generator rand() in 
the standard library to simulate the toss. Use the expression randQ % 2 to gener¬ 
ate the int value 0 or 1. Let “heads” be represented by 1 and “tails” by 0. Run the 
program for 1,000, 10,000, and 100,000 tosses, and keep track of the longest 
sequence of heads and the longest sequence of alternating heads and tails—that is, 
101010.... If you know some probability theory, see if this simulation is in line with 
theory. Caution: In some traditional C systems, the function rand() is notoriously 
bad. All you will get is one long sequence of alternating heads and tails. If your 
ANSI C system has borrowed that particular function, then this exercise, although 
informative, will not be too interesting. 

25 If a random-number generator other than rand() is available to you, use it to redo 
the previous exercise. Are the results of your experiments substantially different? 

26 Simulations that involve the repeated use of a random-number generator to repro¬ 
duce a probabilistic event are called Monte Carlo simulations, so called because 
Monte Carlo has one of the world’s most famous gaming casinos. Let us use this 
technique to find the break-even point in a variation of the birthday game. (See 
exercise 36, on page 324 in Chapter 6, “Arrays, Pointers, and Strings.”) In a roomful 
of people, at least two of them can be expected to have birthdays on the same day 
of the year. A common parly game is to see if this is true. We wish to find the prob¬ 
ability that any two people in a room with n people will have been born in the same 
month. (To do the analysis for the same day rather than the same month requires 
the use of an array; otherwise, the ideas arc the same.) It is clear that the probability 
is 0 if there is only one person in the room. For two people the probability is 1/12. 

;g (We are assuming that it is equally likely for a person to be born in any month. In 
reality this is only an approximation.) Simulate the probability by running, say, 
1,000 trials with 2, 3,... , 20 people in the room. (Is 20 too many?) Use 12 variables 
to count the number of people in the room that were born in each of the 12 months. 
You can use the expression 

rand() % 12 + 1 

If to compute the month of birth for each person. (Use a better random-number gen¬ 
erator, such as 1 rand48(), if one is available to you.) This expression generates an 
integer that is randomly distributed in the range from 1 to 12. When any variable 

s gets to 2, that trial is true, meaning that at least two people in the room were born 
in the same month. The number of true trials divided by 1,000 is the computed sim¬ 
ulated probability. What value of n yields the break-even point? That is, find the 
least n for which the probability is 1/2 or more. 
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27 Experiment with the Towers of Hanoi program^How mam >j'j moved 

solve a game .har starts with u disks on ltower A? 1I ™ t0 t „ we r C? ,f 

each day, how many billions of years wi i ' disk each second, 

the monks attending the towers of Hanot manage » move 
will they complete all moves before the end of the w 
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Arrays, Pointers, and Strings 


An array is a data type that uses subscripted variables and makes possible the repre¬ 
sentation of a large number of homogeneous values. In C, arrays and pointers are 
closely related concepts. An array name by itself is treated as a constant pointer, and 
pointers, like arrays, can be subscripted. A distinguishing characteristic of C is its 
sophisticated use of pointers and pointer arithmetic. Some languages provide “call-by¬ 
reference” so that variables in the calling environment can be changed. In C, pointers 
are used as parameters in function definitions to obtain the effect of “call-by-refer¬ 
ence.” Strings are one-dimensional arrays of characters. They are sufficiently important 
to be treated as a special topic. 


6.1 One-dimensional Arrays 

Programs often use homogeneous data. For example, if we want to manipulate some 
rades, we might declare 

|, int gradeO, gradel, grade2; 

If the number of grades is large, representing and manipulating the data by means of 
unique identifiers will be cumbersome. Instead, an array, which is a derived type, can be 
sed. An array can be thought of as a simple variable with an index, or subscript, 
added. The brackets [] are used to contain the array subscripts. To use grade[0], 
grade[1], and grade[2] in a program, we would declare 

int grade [3]; 
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The integer 3 in the declaration represents the number of elements m the array. The 
indexing of array elements always startsJ at a an identlfier wilh a brack- 

A one-dimensional array declaration xs a type^ ^ must be positive, 

eted constant integral expression. elements in the array. The array sub- 

is the size of the array. It specifies the number subscripts is 0 and 

serin ts can range from 0 to size - 1. I he lower bound oi r 

the upper bound is size - 1. Thus, the following relationships o c. 






int a [size]; /* space for a[0], 


« * > 


a [size - 1] allocated */ 


lower bound = 0 
upper bound = size - 1 
size = upper bound + 1 

It is good programming practice to 


define the size of an array as a symbolic constant. 
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This initializes f [0] to 0.0, f [1] to 1.0, and so on. When a list of initializers is shorter 
than the number of array elements to be initialized, the remaining elements are initial¬ 
ized to zero. For example, 

int a[100] = {0}; 

initializes all the elements of a to zero. If an external or static array is not initialized 
explicitly, then the system initializes all elements to zero by default. In contrast, auto¬ 
matic arrays are not necessarily initialized by the system. Although some compilers do 
this, others do not. Programmers should assume that uninitialized automatic arrays 
start with “garbage” values. 

If an array is declared without a size and is initialized to a series of values, it is 
implicitly given the size of the number of initializers. Thus, 


BRIllPw 

W'-'-ii-’Z'. A ■ 

r-j&tJ : »•- fw? b, l d£33 K 


#define 


100 


a[99] is allocated */ 


int a[N]; /* space for a[0].aLSW is «u — ' 

Given this declaration, the standard programming idiom for processing array elements 1 
is with a for loop. For example: 

f ° l 'sum += 0 a[i]l ++l) /* Process element a[i] 

This iteration starts with ft. element .[•] ami 

turn. Because the termination condition is i ’ rw 1 i 

end of the array. The last element processed is, correctly, a [ 



int a[] = {2, 3, 5, -7}; 


and 


int a[4] = {2, 3, 5, -7}; 


are equivalent declarations. This feature works with character arrays as well. However, 
for character arrays an alternate notation is available. A declaration such as 


char s[] 


abc" 


is taken by the compiler to be equivalent to 


f^T ILI • ^ 


\ 0 '} 


Pr, - 

K2&£ Mteim 


Initialization j 

Arrays may be of storage class automatic, " 

tional C, only external and static arrays can be initialized using an y 

ANSI C, automatic arrays also can be initialized. 

one-dimensionaLarrtw initializer ::= { initializerJist } 
initializerJisl : := initializer { , initializei s&+ 
initializer ::= constantJntegraLexpression 



char s[] = {'a 


Subscripting 


If a is an array, we can write a [ expr] , where expr is an integral expression, to access an 
element of the array. We call expr a subscript, or index, of a. Let us assume that the dec- 
aration 


t nt 


a[N] 


Consider the example 
float f [ 5] = 


ifTOfcw mmm 
ice 'L ... , * * 


{0.0, 1.0, 2.0, 3.0, 4.0}; 



as been made, where N is a symbolic constant. The expression a [i ] can be made to 
efer to any element of the array by assignment of an appropriate value to the subscript 
A single array element a[i ] is accessed when i has a value greater than or equal to 0 
id less than or equal to N - 1. If i has a value outside this range, a run-time error will 
:cur when a[i] is accessed. Overrunning the bounds of an array is a common pro- 
amming error. The effect of the error is system-dependent, and can be quite confus- 
g. One frequent result is that the value of some unrelated variable will be returned or 
odified. It is the programmer’s job to ensure that all subscripts to arrays stay within 
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6,2 ▼ Pointers 


g'iiv->- v = 


Recall that a parenthesis pair () following an identifier tells the compiler that thejp 
identifier is a function name. Examples of this are mai n () and f (a, b). In C, the parerhgt 
thesis pair that follows an identifier is treated as an operator. In a similar fashion, thefj 
bracket pair [] also is treated as an operator. Both operators () and [] have the high 
est precedence and have left to right associativity. 
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i nt 


1, b = 2 


'"***1 At this point, we can think of the variables a, b, and p stored in memory as 
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6.2 Pointers 

A simple variable in a program is stored in a certain number of bytes at a particular 
memory location, or address, in the machine. Pointers are used in programs to access! 
memory and manipulate addresses. JJ 

If v is a variable, then &v is the location, or address, in memory of its stored valui 
The address operator & is unary and has the same precedence and right to left associS 
tivity as the other unary operators. Addresses are a set of values that can be manipu¬ 
lated. Pointer variables can be declared in programs and then used to take addresses as 
values. The declaration j 

int *p; jf 

declares p to be of type pointer to i nt. Its legal range of values always includes the spe§| 
cial address 0 and a set of positive integers that are interpreted as machine address®!®# 
on the given C system. Some examples of assignment to the pointer p are fj|| 


IpL ■l hi !lt.‘ hi : P !i‘ nter P aS “ arrow ' bUt because * has "O' y« been assigned a value 
In** not ^ ow W ^ at P°mts to. Suppose that our next line of code is 


= &a; 


n's.U's-':: 


Smsm 


r read ,hlS “ " P iS aSSig,led the addreSS 0f a ■" and "* fve <*» fbl'ewins Picture: 




wmk.. ; 















m:: , 

Mr. 


flfitfow let us make the assignment 


m 


w* • 


p = 0; 

p = NULL; 
p = &i; 

p = (int *) 1776; 


/* equivalent to p = 0; */ 

/* an absolute address in memory */ 




p - tkmm 
nW& i ’ 





mm & 

rrv. 
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In the third example, we think of p as “referring to i” or “pointing to i” or “containing 
the address of i.” In the fourth example, the cast is necessary to avoid a compiler err||J 
The indirection or dereferencing operator * is unary and has the same precedence 
and right to left associativity as the other unary operators. If p is a pointer, then *p is 
the value of the variable of which p is the address. The name “indirection” is taken froj^jjj 
machine language programming. The direct value of p is a memory location, whereas f§g 
is the indirect value of p—namely, the value at the memory location stored in p. In a eer->| 
tain sense * is the inverse operator to &. We want to give an explicit, yet elementajjjjjl 
example of how the pointer mechanism works. Let us start with the declaration JS| 
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j§ b = *p; is equivalent to b = a; 

tJflTgf 3 Sim J le f ° gram th3t illustrates the distinction between a pointer value 
which on mostTvstemf TraT ^ T ^ %P f ° rmat t0 print the value of a Pointer, 

pss «r 6 ““ number - on ansi c ~ the %p 
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In file locate.c 

/* Printing an address, or location. */ 

#include <stdio.h> 

int main(void) 

{ 

int i = 7, *p = &i; 

printf("%s%d\n%s%p\n", " Value of i : ", *p 
"Location of i: ", p); 
return 0; 

} 



ML 7 * * p / * q + 7 
If instead we had written 

7 * * p /* q + 7 


/* trouble? */ 


: If* 
11 


Mmk 


The output of this program on our system is 

Value of i: 7 
Location of i: effffb24 

A pointer can be initialized in a declaration. The variable p is of type int * and its ini-| 
tial value is &i. Also, the declaration of i must occur before we take its address. The , 
actual location of a variable in memory is system-dependent. The operator * derefeb|| 
ences p. That is, p contains an address, or location, and the expression *p has the value! 
of what is stored at this location appropriately interpreted according to the type decla-j 
ration of p. 

The following table illustrates how some pointer expressions are evaluated: 


gpl|we would find that the compiler treats /* as the start of a comment. This can result in a 
Mggg g difficult bug. 

“§1 In traditional C, conversions during assignment between different pointer types usu- 
gggsl §|ally are allowed. In ANSI C, such conversions are not allowed unless one of the types is 
jBfef l pointer to voi d, or the right side is the constant 0. Thus, we can think of voi d * as a 
||fc||jgeneric P°* nter type- t his is an important point. (See Section 6.8, “Dynamic Memory 
milocation With call oc() and malice (),” on page 262”.) 


wm 

r: 
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1 Declarations and initializations 

int i = 3, j = 5, *p = &i , *q = & j , *r; 

double x; 

Expression 

Equivalent expression 

Value 

p == & i 

«!— 

II 

1! 

CL 

1 

* * & p 

* C* C& P)) 

3 

r = & x 

“5 

II 

A 

Qo 

X 

/* illegal */ 

7 * * p / * q + 7 

(((7 * (* p))) / (* q)) + 7 

11 

* (r = & j) *- * p 

(* (r = (& j))) *= (* p) 

15 


wt 

. lliiiKfeatea 


Declarations 

int *p; 

m .- 

f1 oat *q; 


void *v; 


Legal assignments 

Illegal assignments 

P = 0; 

P = l; 

p = (int *) 1; 

v = 1; 

p = v = q; 

P = q; 

p = (int *) q; 



- r ii 

11 


Of course, not every value is stored in an accessible memory location. It is useful to 
in mind the following prohibitions: 




mm, . 


In this table, we attempted to assign r the value &x. Because r is a pointer to i nt and 
the expression &x is of type pointer to doubl e, this is illegal. Also, note that in the table 
we used the expression 


safer 


Constructs not to be pointed at 

■ Do not point at constants. 

&3 /* illegal */ 

■ Do not point at ordinary expressions. 

&(k + 99) /* illegal */ 

m Do not point at register variables, 
register v; 

&v /* illegal */ 


ifflie address operator can be applied to variables and array elements. If a is an array 
!»en expressions such as &a [0] and &a [i + j+3] make sense. 


ms 
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6.3 Call-by-Reference m 

■ i-i „„ .ironmerits to a function, their values are copied to v.? 

Whenever variables are passed as argumen themselves are n0 , changed,; 

the corresponding function parameters, , . is str ictly adhered to in C. 5 

in the calling environment!Lhis ca r y-Ve environmentj other languages provide 

To change the values of variables^ l j! e J h how the use 0 f addresses of 

the “call-by-reference” mechanism. In this section we vaii-hv-reference.” II 

variables as arguments to functions can produce the effect of ) • 

vanames as argumcma . . must be used in the parameter 

For a function to effect "call-by-reference pointers — used^^ 

list in the function definition Then «hen the iuncn ^ function that swaps I 

must be passed as arguments. As an example 01 

the values of two variables in the calling environmen . ^ 

•Jf§g 

void swap(int *p, int *q) j§ 

{ '"iOs 

int tmp; 

tmp = ,f p; 

*p = *q; 

*q = tmp; « 

} 

To test our function, we can write *mm 

#include <stdio.h> mt 

•$?$$§£ 

void swap(int *, int *); m 

int main (void) ®| 

{ . ii 

int i = 3, j = 5; 

|p 

p“mfc'%d j) i.d\n", 1. j); /* 5 3 is » rinted % 

return 0; 

} 

Note that in the calling environment we have paissed ad dre “ es 
function, and that in the function definition we have used pointers. This is consi g 

pattern. 

1 



sWKSiM, 

j 

fe ' 

V 

Dissection of the swapQ Function 

|pg| ; 

sTB void swap(int *p, int *q) 

*' { 

int tmp; 

* % This function takes two arguments of type pointer to i nt and returns not hing, The vari¬ 
able tmp is local to this function and is of type i nt. As the name indicates, we think of 
* this as temporary storage. 

tmp = *p; 

*P = *q; 

*q = tmp; 

||The variable tmp is assigned the value pointed to by p. The object pointed to by p is 
||assigned the value pointed to by q. The object pointed to by q is assigned the value tmp. 

This has the effect of interchanging in the calling environment the stored values of 
jHwhatever p and q are pointing to. 

, The effect of "call-by-reference’’ is accomplished by 

1 Declaring a function parameter to be a pointer 

2 Using the dereferenced pointer in the function body 

3 Passing an address as an argument when the function is called 

6.4 The Relationship Between Arrays and Pointers 

»ff : An array name by itself is an address, or pointer value, and pointers, as well as arrays, 
can be subscripted. Although pointers and arrays are almost synonymous in terms of 
how they are used to access memory, there are differences, and these differences are 
j subtle and important. A pointer variable can take different addresses as values. In con- 
H trast, an array name is an address, or pointer, that is fixed. 

Suppose that a is an array and that i is an i nt. It is a fundamental fact that the 
expression 










mi 
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a[i ] 


i s equivalent to 


*(a + i) 


p[i ] 


is equivalent to 


Cp + i) 


This means that we can (and do) use array notation with pointers. Expressions such as 
a + i and p + i are examples of pointer arithmetic. The expression a + i has as its 
value the fth offset from the base address of the array a. That is, it points to the z'th ele¬ 
ment of the array (counting from 0). In a similar manner, p + i is the zth offset from 

the value of p. The actual address produced by such an offset depends on the type that 
p points to. 

When an array is declared, the compiler must allocate a sufficient amount of contigu¬ 
ous space in memory to contain all the elements of the array. The base address of the 
array is the initial location in memory where the array is stored; it is the address of the 
first element (index 0) of the array. Consider the following declarations: 


#define 


100 


i nt 


a[N] 


p, sum = 0 


Suppose that the system assigns 300 as the base address of the array and that memory 
bytes numbered 300, 304, 308, ..., 696 are allocated as the addresses of a[0], a[l], 
a [2]. - , a [99], respectively. We are assuming that each byte is addressable and that 4 
bytes are used to store an int. This is system-dependent. The statement 


p = a; 


i s equivalent to 


p = &a[0]; 


It causes 300 to be assigned to p. Pointer arithmetic provides an alternative to array 
indexing. The statement 


a + 1 


is equivalent to 


p = &a[l]; 


It causes 304 to be assigned to p. Assuming that the elements of a have been assigned 
values, we can use the following code to sum the array: 

for (p = a; p < &a[N]; ++p) 
sum += *p; 

First p is initialized to the base address of the array. In the body of the loop, the vari¬ 
able sum is incremented by the value pointed to by p. Each time through the loop, p is 


4 

y 


The expression a[i] has the value of the ith element of the array (counting from 0), 

whereas *(a + i) is the dereferencing of the expression a + i, a pointer expression 

that points i element positions past a. If p is a pointer, then in a similar fashion the 
expression 
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incremented, so its successive values are &a[0], &a[l], ... , &a[N-l]. Here is another 
way of summing the array: 

for (i =0; i < N; ++i) 
sum += *(a + i); 

In the body of the loop the pointer value a is offset by i and then dereferenced. This 
produces the value a[i ]. Finally, here is a third way of summing the array: 


for (i =0; i < N; ++i) 
sum += p [i ]; 

Note that because a is a constant pointer, expressions such as 


++a 


a += 2 


are illegal. We cannot change the value of a. 


6.5 


Pointer Arithmetic and Element Size 


Pointer arithmetic is one of the powerful features of C. If the variable p is a pointer to a 
particular type, then the expression p + 1 yields the correct machine address for stor¬ 
ing or accessing the next variable of that type. In a similar fashion, pointer expressions 
such as p + i and ++p and p += i all make sense. If p and q are both pointing to ele¬ 
ments of an array, then p - q yields the i nt value representing the number of array 
elements between p and q. Even though pointer expressions and arithmetic expressions 
have a similar appearance, there is a critical difference in interpretation between the 
two types of expressions. The following code illustrates this: 


double a[2], *p, *q; 

p = a; 
q = p + X | 

printf("%d\n", q - p); 
printf("%d\n", (int) q 


/* points to base of array */ 
/* equivalent to q = &a[l] */ 

/* 1 is printed */ 
(int) p); /* 8 is printed */ 


On most machines, a double is stored in 8 bytes. Because p points to a double and q 
points to the next double, the difference in terms of array elements is 1, but the differ¬ 
ence in memory locations is 8. 
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6.7 T An Example: Bubble Sort 


.5 


6.6 Arrays as Function Arguments 


In a function definition, a formal parameter that is declared as an array is actually a 
pointer. Corresponding to this, when an array is passed as an argument to a function, 
the base address of the array is passed “call-by-value.” The array elements themselves 
are not copied. As a notational convenience, the compiler allows array bracket notation 
to be used in declaring pointers as parameters. To illustrate these ideas, we write a 1 
function that sums the elements of an array of type double. 



Various ways that sum() might be called 


Invocation 


What gets computed and returned 


wmmx^r. 
I J 

fl 

IfAII 
’-WW II 

iafpffli 


i 


/* n is the size a[] */ 


double sum(double a[], int n) 

{ 

i nt i; 

double sum = 0.0; 

for (i =0; i < n; ++i) 
sum += a[i ]; 
return sum; 


Note carefully that in the header to the function definition the declaration of thefl 
parameter a as an array is equivalent to its declaration as a pointer. This means that anfj 
equivalent function definition is given by ilk 



double sum(double *a, int n) 

{ 


/* n is the size a[] */ 


Although an array declaration is equivalent to a pointer declaration in a parameter list, 
this equivalence does not hold for external declarations or for declarations within the 
body of a function. | 

Suppose that in mai n() we have an array, or vector, v declared with 100 elements. 
After the elements of the vector have been assigned values, we can use the above func- 
tion sum() to add various of the elements of v. The following table illustrates some of 
the possibilities: 




sum(v, 100) 


sum(v, 88) 


sum(&v[7], k - 7) 


sum(v +7, 2 * k) 


v[0] + v[l] + . 


V[0] + V[l] + . 


v[7] + v[8] + . 


v[7] + v[8] + . 


.c... + v[99] 


* L m m ft * V O / J 


.c... + v[k - 1] 


.c... + v[2 * k + 6] 


The last call illustrates again the use of pointer arithmetic. The base address of v is off¬ 
set by 7, causing the local pointer variable a in sum() to be initialized to this value. This 
causes all address calculations inside the function call to be similarly offset. 


6.7 An Example: Bubble Sort 

Although we presented a bubble sort in Section 1.8, “Arrays, Strings, and Pointers,” on 
page 37, we will do so again here and illustrate in some detail how the function works 
on a particular array of integers. We will use the function swap () from the previous 
section. 


void swap(int *, int *); 

void bubble(int a[], int n) 

{ 

int i, j; 


/* n is the size of a[] */ 


for (i = 0; i < n - 1; ++i) 
for (j = n - 1; j > i; --j) 
if (a[j-1] > a[j]) 

swap(&a[j-l], &a[j]); 


Because of pointer arithmetic, the expressions a + i and &a[i] are equivalent. Thus, 
the function call to swapO could have been written 
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Now we describe how the bubble sort works. Suppose that in mai n() we have 

int a[] = {7, 3, 66, 3, -5, 22, 77, 2}; 

bubb1e(a, 8); 

The following table shows the elements of a[] after each pass of the outer loop in the 
function bubble(): 



Unordered data: 

/ 

3 

First pass; 


7 

Second pass: 

-77 

-5 

Third pass: 

-77 

-5 

Fourth pass: 

-77 

-5 

Fifth pass: 

-77 

-5 

Sixth pass: 


-5 

Seventh pass: 

-77 

-5 


m 
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6.8 


Dynamic Memory Allocation With calloc() 
and mallocO 


BIBfe provides the functions calloc() and mallocO in the standard library, and their 
H function prototypes are in stdiib.h. The name cal 1 oc stands for “contiguous alloca¬ 
tion,” and the name malloc stands for “memory allocation.” 

- ' 'y‘ 

fife The programmer uses calloc() and mallocO to dynamically create space for 
H fc jfrrAys, structures, and unions. (See Chapter 9, “Structures and Unions,” and Chapter 10, 
Substructures and List Processing," for structures and unions.) Here is a typical example 
HI that shows how callocQ can be used to allocate space dynamically for an array: 


'ff-Z-i- Y". *. . :.. 

- 

illy • 

I 


#include <stdio.h> ■ 
finclude <stdlib.h> 

int main(void) 

{ 


At the start of the first pass, a[6] is compared with a [7]. Because the values are IB 
order, they are not exchanged. Then a[5] is compared with a[6], and because thes| 
values are out of order, they are exchanged. Then a[4] is compared with a[5], and s| 
on. Adjacent out-of-order values are exchanged. The eifect of the first pass is to bub¬ 
ble” the smallest value in the array into the element a[0]. In the second pass a[0] is 
left unchanged and a [6] is compared first with a [7], and so on. After the second pass, 
the next to the smallest value is in a[l]. Because each pass bubbles the next smallest 
element to its appropriate array position, the algorithm will, after n - 1 passes, have all 
the elements ordered. Notice that in this example all the elements have been ordered 
after the fifth pass. Tt is possible to modify the algorithm so that it terminates earlier by 
adding a variable that detects if no exchanges are made in a given pass. We leave this as 
an exercise. 

A bubble sort is very inefficient. If the size of the array is n, then the number of com¬ 
parisons performed is proportional to n 2 . The merge sort discussed in Section 6.9, Ai| 
Example: Merge and Merge Sort,” on page 266, is much more efficient; it is an n log ik 

algorithm. 


V £'« . ' • 


i nt 
int 


a f 

n: 


/* to be used as an array */ 

/* the size of the array */ 

/* get n from somewhere, perhaps 
interactively from the user */ 
/* get space for a */ 

/* use a as an array */ 


HHp- a = calloc(n, sizeof(int)); /* get space for a */ 

jjljjji . /* use a as an array */ 

fjf The function callocQ takes two arguments, both of type size_t. In ANSI C, size_t 
jfj must be an unsigned integral type. Typically, a typedef is used in stdiib.h to make 

iSStisize_t equivalent to the type unsigned int. A function call of the form 

|Br 

§| 

Sm cal 1 oc(n, eL size) 




^ allocates contiguous space in memory for an array of n elements, with each element 
111 having eL size bytes. The space is initialized with all bits set to zero. If the call is sue- 
|g§ cessful, a pointer of type void * that points to the base of the array in memory is 
|J‘ returned; otherwise, NULL is returned. Note that the use of the sizeof operator makes 
§£ the code appropriate for machines having either 2- or 4-byte words. 


i 













t of 
ts t( 


liar fas 
ssful, i 
; other; 


Unlike callocO, the function mallocO does not initialize the space in memory that it 
makes available. If there is no reason to initialize the array to zero, then the use of" 
either callocO or mallocO is acceptable. In a large program, mallocO may take lessi 
time. ;f 

Space that has been dynamically allocated with either callocO or mallocO doesf 
not get returned to the system upon function exit. The programmer must use f ree() 
explicitly to return the space. A call of the form 

free(ptr) fj 

causes the space in memory pointed to by ptr to be deallocated. If ptr is NULL, th|| 
function has no effect. If ptr is not NULL, it must be the base address of space previ¬ 
ously allocated by a call to callocO, mallocO, or real loc() that has not yet been 
deallocated by a call to f ree() or realloc(). Otherwise, the call is in error. The effect 
of the error is system-dependent. 

Let us write a small program that illustrates the ideas we have presented in this sec¬ 
tion. The first printf O statement in the program tells the user exactly what the pro¬ 
gram does. 

| 

#include <stdio.h> 

#include <stdlib.h> JS 

#include <time.h> 

|>VI 

void fill_array(int *a, int n); 

int sum_array(int *a, int n); f| 

void wrt_array(int *a, int n); 


m 
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int main(void) 

{ 

int *a, n; 

seed the rand °m number generator */ 

printf( \n%s\n , 

"This program does the following repeatedly\n" 

" k n 

" 1 create space for an array of size n\n" 

" 2 fill the array with randomly distributed diaitsXn" 

;; 3 print the array and the sum of its elementXn" 

4 release the space\n"); 
for ( ; ; ) { 

printf("Input n: "); 
if (scanf("%d", &n) != 1 || n < 1) 
break; 

putchar('\n'); 

a = caHoc(n, sizeof(int)); /* allocate space for a[] */ 

fill_array(a, n); 

wrt_array(a, n); 

printf("sum = %d\n\n", sum_array(a, n)); 
free(a); 

} 

printf("\nBye!\n\n"); 
return 0; 


void fi1l_array(int *a, int n) 

{ 

int i; 


for (i = 
a [i ] 


0; i ^ n; -(“+1) 

= rand() % 19 - 9 


int sum_array(int *a, int nl 

{ 

int i, sum = 0; 

for (i =0; i < n; ++i) 
sum += a[i]; 
return sum; 
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void wrt_array(int *a, int n) 

{ 

int i; 



printf("a = ["); 

for (i = 0; i < n; ++i) 

printf("%d%s", a[i], C(i < n - 1) ? ", 


"]\n")); 



Offsetting the Pointer 

For arrays (vectors) intended for mathematical use, we often want to index the arrays 
from 1 instead 0. In a small program we can do something like the following: ■, 

int n; J 

double *a; 

a = calloc(n + 1, sizeof(double)); 

Because the size of a is one more than n, we can disregard a[0] and use a[l],..., a[n] 
as needed. This is certainly an acceptable scheme. However, an alternative scheme is tj| 
do the following. First we write 

a = calloc(n, sizeof(double)); 

At this point, here is what we have in memory: 


a 

„ 

012 n-1 


Next, we write 



/* offset the pointer */ 
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• -.A” " < 


• 1. mm 




Ifhe picture of what is in memory changes to the following: 


Sttfl 





ggplg 









0 1 2 


Notice that the shaded area indicates memory the programmer does not own. Thus, 
a[0] should not be accessed: neither written to nor read. The elements a[l],..., a[n], 
however, are now accessible for use by the programmer. To deallocate the space, we 
can write free (a + 1). 

In Section 12.6, “Dynamic Allocation of Matrices,” on page 571, we will expand on the 
ideas presented in this section when we deal with mathematical matrices. 


6.9 An Example: Merge and Merge Sort 

Suppose we have two ordered arrays of integers, say a [] and b []. If we want to merge 
them into another ordered array, say c[], then the algorithm to do so is simple. First, 
compare a[0] and b [0]. Whichever is smaller, say b [0], put into c[0]. Next, compare 
a[0] and b [1]. Whichever is smaller, say b[1], put into c[1]. Next, compare a[0] and 
b [2]. Whichever is smaller, say a[0], put into c[2]. Next, compare a[l] and b [2], and 
so on. Eventually one of the arrays a[] or b[] will be exhausted. At that point, the 
remainder of the elements in the other array are simply copied into c[]. Here is a func¬ 
tion that does this, along with a header file that will get used in a program that tests 
our functions: 

In file mergesort.h 




finclude <assert.h> 
#include <stdio.h> 
#include <stdlib.h> 


void 
void 
voi d 


merge (int a[], int b[], int c[], int m, int n); 

mo rnacn rl" fi ni~ kpvfl --a- 
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In file merge, c 

/" Merge a[] of size m and b[] of size n into c[], */ 
#include "mergesort.h" 

void merge(int a[], int b[], int c[], int rn, int n) 
int i = 0 , j = 0 , k = 0 ; 

while (i < m && j < n) 
if (a[i] < b[j]) 
c[k++] = a[i++]; 
else 

cfk+-{-l 3s bfi+4.1* 

Whl c[k+i] < = m a[i ++ i- ' /Vf P1 ’ ck up any remai "nder */ 
while (j < n) 

c[k++] = b[j++]; 


The array c[] is assumed to contain enough space to hold both a[] and b[]. The pro- 

tlWt thG b ° Unds ° n C[] are not ovemm - Observe that one, 
or possibly both, of the last two whi 1 e statements that are used to pick up any remain- 

er o not get executed. This is because at least one of the two conditions i < m and 
j < nwill not be true. 

Next we will write a mergesortC) function that calls merge(). In contrast to a bub- 

kivri whi^T e SOrt 18 r yeffiCient - ° UT function mergesortC) will act on an array 
m V ,’u hjch l has a Slze that 1S a power of 2. The “power of 2” requirement will help to 
make the explanation simpler. In exercise 16, on page 315, we indicate how this restric- 

can e removed. To understand how merge sort works, let us suppose that key[] 

contains the following 16 integers: yu 
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Unordered 

A 

X 

- 3 — 

1 

67 

55 

T" 

0 

4 

”5 

37 


4 

T~ 

9 

i 

-1 

data: 

















First pass: 

o 

4 

1 

67 

8 

55 

0 

4 

-5 

37 

4 

7 

2 

Q 

y/ 

-i 

i 

Second pass: 

1 

3 

A 

67 

0 

4 

8 

55 

-5 

4 

7 

37 

-i 

1 

2 

9 

Third pass: 

0 

i 

3 

4 

A 

8 

55 

67 

-5 

-i 

1 

2 

4 

7 

9 

37 

Fourth pass: 

-5 

-1 

0 

1 

1 

2 

3 


4 

4 

7 

8 

9 

37 

55 

67 


After the first pass, w 7 e want each successive pair of integers to be in order. After the 
second pass, we want each successive quartet of integers to be in order. After the third 
pass, we want each successive octet of integers to be in order. Finally, after the fourth 
pass, we want all 16 of the integers to be in order. At each stage mergeO is used to 
accomplish the desired ordering. For example, after the third pass we have the two sub¬ 
arrays 


0 1 3 4 4 8 55 67 


and 


1 1 2 4 7 9 37 


mmm. 


which are both in order. By merging these two subarrays, we obtain the completely 
ordered array given in the last line of the table above. Surprisingly, the code that 
accomplishes this is quite short and it illustrates the power of pointer arithmetic. 
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In file mergesort.c 

/* Mergesort: Use merge() to sort an array of size n. */ 

#include "mergesort.h" 

void mergesort(int key[], int n) 

{ 


int g, k, m, *w; 
for (m = 1; m < n; m *= 


2 ) 


/* m is a power of 2 */ 


bye!\n") 


if (n < m) { 

printf("ERROR: Array size not a power of 2 
exit(l); 

} 

w = callocCn, sizeof(int)); /* allocate workspace */ 

assert(w != NULL); /* check that callocC) worked */ 

for (k = 1; k < n; k *= 2) { 

for (j = 0; j < n - k; j += 2 * k) 

/it 

// Merge two subarrays of key[] into a subarray of w[]. 

V 

merge(key + j, key + j + k 
for (j =0; j < n; ++j) 


} 


key[j] = w[j]; 


} 


free(w); 


w + j, k, k); 

/* write w back into key */ 
/* free the workspace */ 













Dissection of the mergesort () Function 

■ for (m = 1; m < n; m *= 2) 


After this loop, the value of m is the smallest power of 2 that is greater than or equal to 
n. If n is a power of 2, then n will be equal to m. 


if (n < m) { 

pri ntf("ERROR: Array size not a power of 2 
exit(l); 

} 


bye!\n") 


mm 










m 






w = cdalloc(n, sizeof(int)) 


/* allocate workspace */ 


mm 




mB 


.. ■ . 1 . 


i 

” v fa : v . „ 


tlltf . 


life 


We test to see if n is a power of 2. If it is not a power of 2, we exit the program with an 
appropriate message. 






life 

fa . 


■ 

mm. 

Pip/, 




The function calloc() is in the standard library, and its function prototype is in the 
header file stdlib.h. The function is used to allocate space in memory for an array 
dynamically. The first argument to the function specifies the number of elements in the 
array, and the second argument specifies the size in bytes needed to store each ele¬ 
ment. The function returns a pointer of type void * to the allocated space. Because its 
type is void *, the pointer can be assigned to w without a cast. In traditional C, we 
would have written 

w = (int *) calloc(n, sizeof (int)); 

Although w is a pointer to i nt, we can now use it just as we would an array. Even 
though mergesortQ is very efficient, the fact that additional workspace is needed to 
sort an array is a disadvantage. In Chapter 8 we will discuss the use of quicksortQ, 
which does not require additional workspace. 


assert(w != NULL); 


/* check that callocQ worked */ 


If callocQ fails to allocate space—for example, the system free store (heap) is 
exhausted—then NULL is returned. We used the assert() macro in assert.h to make 
sure this did not happen. If the expression w ! = NULL is false, then the program will be 
aborted at this point. 

■ for (k = 1; k < n; k *= 2) { 

for (j =0; j < n - k; j += 2 * k) 

/* 

// Merge two subarrays of key[] into a subarray of wf]. 

*/ 

merge(key + j, key + j + k, w + j, k, k); 
for (j =0; j < n; ++j) 

key[j] = w[j]; /* write w back into key */ 

} 

This is the heart of the algorithm. Suppose we start with key [] having the data given as 
unorderecl in the table above. In the first pass of the outer loop, k is 1. Consider the 
first inner loop 

for (j = 0; j < n - k; j += 2 * k) 

/i< 

// Merge two subarrays of key[] into a subarray of w[]. 

* j 

merge(key + j, key + j + k, w + j, k, k); 
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The first call to merge() is equivalent to 

merge(key + 0, key + 0 + 1, w + 0, 1, 1); || M " , 

-.s' 

The arrays based at key and key + 1, both of size 1, are being merged and put into a lj| 
subarray of size 2 of w that is based at w + 0. This will result in w[0] and w[l] being in 'JH 
order. The next call to mergeQ is equivalent to 




merge(key + 2, key + 2 + 1, w + 2, 1, 1); 

The arrays based at key + 2 and key + 3, both of size 1, are being merged and put 
into a subarray of size 2 of w that is based at w + 2. This will result in w[2] and w[3] 
being in order. The next call... , and so on. After the first pass of the outer loop, each 
successive pair of elements in w[] is in order; see the preceding table. At this point, the 
array w[] is copied into key []. 

In the second pass of the outer for loop, k is 2, and the next call to mergeO is equiv¬ 
alent to the following: 

merge(key + 0, key + 0 + 2, w + 0, 2, 2); 

The arrays based at key and key + 2, both of size 2, are being merged and put into a 
subarray of size 4 of w that is based at w + 0. This will result in w[0], w[l], w[2], w[3] 
being in order. The next call to mergeO is equivalent to 

merge(key + 4, key + 4 + 2, w + 4, 2, 2) ; 

The arrays based at key + 4 and key + 6, both of size 2, are being merged and put 
into a subarray of size 4 of w based at w + 4. This will result in w[4], w[5], w[6], w[7] 
being in order. The next call..., and so on. After the second pass of the outer loop, each 
successive quartet of elements in w[] is in order; see the preceding table. At this point 
the array w[] is copied into key[]. In the third pass of the outer loop, k is 4, and the 
next call to mergeO ..., and so on. 


I |; 








free(w); 


/* free the workspace */ 


This function is in the standard library, and its function prototype is in stdlib.h. It 
causes the space pointed to by w to be deallocated. The system is then able to use this 
space for some other purpose. Unlike automatic storage, space allocated by callocO 
or mal 1 oc() is not relinquished automatically upon exit from a function. The program¬ 
mer must explicitly free this space. 
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We want to write a program to test the functions mergeO and mergesortO. The inter¬ 
ested reader should modify the program to print out the array key [] after each pass of 
the outer loop in mergesortO to check that the preceding table is reproduced. Here 
are the remaining two functions that comprise our program: 

In file main.c 

j§ /* Test mergeO and mergesortO. V 
#include "mergesort.h" 


int main(void) 

{ 

int sz, key[] 


{ 4, 3, 1, 67, 55, 8, 0, 4, 
-5, 37, 7, 4, 2, 9, 1, -1 }; 


sz = sizeof(key) / sizeof(int); 

printf("Before mergesort:\n"); 

wrt(key, sz); 

mergesort(key, sz); 

printf("After mergesort:\n"); 

wrt(key, sz); 

return 0; 


/* the size of key[] */ 


In file wrt.c 


#include "mergesort.h" 

void wrt(int key[], int sz) 

{ 

int i; 

for (i =0; i < sz; ++i) 

printf("%4d%s", key[i], ((i < sz 


- 1) ? "" : "\n")); 


The amount of work that a merge sort does when sorting n elements is proportional 
to n log n. Compared to a bubble sort, this is a very significant improvement. Sorting is 
often critical to the efficient handling of large amounts of stored information. However, 
it is beyond the scope of this text to discuss the topic in detail; see The Art of Computer 
Programming, vol. 3, Sorting and Searching, by Donald Ervin Knuth (Reading, MA: Add- 
ison-Wesley, 1973). 
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6.10 Strings 

nSed H im f enS1 ° naI ariayS ° f type char ‘ By invention, a string in C is term! 

. ,1 ^ end-of-string sentinel \0, or null character. The null character is a hvr 1 

With all bits off; hence, its decimal value is zero. Because desteXl ' k " ? 

z: £ v gth 7eSedtT0 ic h se rr y '* is usew to twnk - »*** zz 

strincr Thf ’ delin “ ted b > \ 0 - but wth a maximum length determined by the size of the 
n P ] f; [ S1 f of a strm § must inclu de the storage needed for the end-of-string semi- 

are not overm!^ 9 ^’ lt “ ^ j ° b ° f the programmer t0 make sure tbat string bounds 

tttSworJi^ 8 wnhTL 11 ? bet r Cn d ° Uble qUOtes ’ For exam P |e - " abc " is a charac- 
iay of size 4, with the last element being the null character \0. Note that strincr 

fr ° m Ch f K,CT COnS,mtS ' *» «“«*. ? Z S 

value' ' \r haS elements - lhe first ™th value ’ a ’ and the second with 

Itstaln?,? fh n ! tant ’ 'if an afray name by itself ’ is treated by the compiler as a pointer 
ts value is the base address of the string. Consider the following code: 1 


char 


'abc"; 


printf( %s\n , p, p + 1); /* abc be is printed */ 

potato 1o b cha P r i i S snrtS d Z b / se address 0f the cl,aracter array "abc". When a 
successive II d m ' i fonMt of a strln «' ,he P°inted-at character and each 

PHntTo state ZZZheZZesZlT ^ 18 reached - Thus ' “ tha 

d + 1 whirh !’ !l f P re ssion p causes abc to be printed, and the expression 

a string constant ,rh° ■ f f. m the String " abc "’ causes bc to be P™ted. Because 
b s tant such as abc is treated as a pointer, expressions such as 


"abc"[1] 


and 


("abc" + 2) 


are possible. (See exercise 22, on nase SI 71 Snrh pvnrpccinno , . 

code, but they help emphasize that ‘™“ 

*” Slmllar USeS - They alS0 have d,ffer - 


char 


*n - 


"abede" 


and 


char s[] = "abede" 
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In the first declaration, the compiler allocates space in memory for p, puts the string 
constant "abede" in memory somewhere else, and initializes p with the base address of 
the string constant. We now think of p as pointing to the string. The second declaration 
js equivalent to 

char s[] = {'a', 'b\ 'c 1 , 'd', ' e * , * \0'} ; 

Because the brackets are empty, the compiler allocates 6 bytes of memory for the array 
s. The first byte is initialized with ' a', the second byte is initialized with ' b', and so 
on. Here is how we think of these objects stored in memory: 



A char is always stored in 1 byte, and on most machines a pointer is stored in a word. 
Thus, on our machine, p is stored in 4 bytes, and s is stored in 6 bytes. Of course, p con¬ 
tains the address of a string that requires another 6 bytes of storage. (See exercise 24, 
on page 319, for further discussion.) 

For technical reasons, it is better not to print null characters. If null characters are 
put into a file and then another utility is used to process the file, the result can be con¬ 
fusing. (See exercise 25, on page 319.) However, the printing of null strings is perfectly 
acceptable. One natural instance of this occurs when dealing with plurals: 

char *s; 

int nfrogs; 

..... /* get nfrogs from somewhere */ 

s = (nfrogs == 1) ? "" : "s"; 

printf("We found %d frog%s in the pond.\n", nfrogs, s); 

To illustrate string processing, we will write a function that counts the number of 
words in a string. We assume tbat words in the string are separated by white space. Our 
function will use the macro i sspaceO, which is defined in the standard header file 
ctype.h. This macro is used to test whether a character is a blank, tab, newline, or some 
other white-space character. If the argument is a white-space character, then a nonzero 
(true) value is returned; otherwise, zero (false) is returned. 
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/* Count the number of words in a string. */ 

#include <ctype.h> 

int word_cnt(const char *s) 

{ 

int cnt = 0: 


ife V: H S 

r Ip 

• ;-Vi 


% \ S' 

&»{ 3 l 

'A $3? 

, y 


while (*s != ’\0’) { 
while (isspace(*s)) 

++S * 

if (*s'!= ’\0’) { 

++cnt; 

while (!isspace(*s) && *s 
++s; 

} 


\ 0 ’) 


w 

|p 

/* skip white space */ IMF 

/* found a word */^L 

MSI 

i /* skip the word */ J 8| 

■ r}0. 

\;i\-'ffMs&j&ypi, 

r'lfi 


return cnt; 




This is a typical string-processing function. Pointer aritlnnetic and dereferencing are^ 
used to search for various characters or patterns. Often a character pointer is used 5 jp|| 
march along a string while parsing it or interpreting it in some way. 

• 

*»• 


6.11 String-Handling Functions in the Standard Library 

ANSI C provides the programmer with numerous string-handling functions; see Appep 
dix A, “The Standard Library,” for details. In this section we want to illustrate the usej|j| 
a few selected string-handling functions and describe how they work. The function prj 
to types for string-handling functions are given in the standard header file string.h. 

We will see that some of the parameters In the function definitions of the string-han 
dling functions have the type const char *. The type qualifier const in this context ||l 
telling the compiler that the character pointed to in memory should not be changed. 
(See exercise 28, on page 321.) The pointer itself, however, can be changed. 



. i 
•; \gh- 
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Some String-Handling Functions in the Standard Library 

W char *strcat(char *sl, const char *s2); 

phis function takes two strings as argumenls, concatenates them, and puts the result in 
si. The programmer must ensure that si points to enough space to hold the result. The 
string si is returned. 

I int strcnip(const char *sl, const char *s2); 

Two strings are passed as arguments. An integer is returned that is less than, equal to, 
or greater than zero, depending on whether si is lexicographically less than, equal to, 
or greater than s2. 

■ char *strcpy(char *sl, const char *s2); 

The characters in the string s2 are copied into si until \0 is moved. Whatever exists in 
si is overwritten. It is assumed that si has enough space to hold the result. The pointer 
si is returned. 


■ size_t strlen(const char *s); 

A count of the number of characters before \0 is returned. ANSI C requires the type 
size_t to be an integral unsigned type. Typically, on systems with 4-byte i nts it is 
equivalent to unsigned int, and on systems with 2-byte i nts it is equivalent to 
unsi gned 1 ong. 


ill 



There is nothing special about these functions. They can be written in C, and are all 
quite short. Variables in them are often declared to have storage class register in an 
attempt to make them execute more quickly. Here is one way the function strl en() 
could be written: 

size_t strlen(const char *s) 

f { 

I: size_t n; 


for (n = 0; *s != 
++n; 

return n: 


'\0'; ++s) 
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«i 


The loop continues counting until the end-of-string character ' \0’ is detected, Fgj 
example, strlen("abc") returns the value 3, and strlenC"’) returns the value 0. Tgj| 
The function definition for strcpyC) is also simple, but it contains a C idiom thaft 
needs to be explained in detail. 




■fe®- i-.-y-ii 

- sJijl§ 


pHh+ 


^ C 7 *L X 
w> l* T 1 





char *strcpy(char *sl, register const char *s2) 

{ 

register char *p = si; 
while (”pn—h = "s2++) 

M 

} 

return si; 



g ^a^<5%Kt*5itnl*pif*r*4 1 X'’tsz?w?rw/?sx;i h'Wwrvtfw 


Igyppose p is pointing to a character in one string, and suppose s2 is pointing to a char¬ 
acter in a different string. (See exercise 18, on page 316, for another scenario.) The 
llue pointed to by s2 is assigned to the value pointed to by p, and this is the value of 
■the expression as a whole. Then both p and s2 get incremented, which causes them to 
point to the next character in each of the two strings. 



Dissection of the strcpyQ Function 


register char 


s X j 



g while (*p++ = *s2++) 

pr 

iff r 

Jpthe character pointed to by p gets assigned the value of the character pointed to by s2. 
®| jhen both p and s2 get incremented so that they each point to the next character. This 
H; process continues until s2 points to the null character \0, which has the value 0. At that 
■'point the value 0 is assigned to what p is pointing to, putting an end-of-string sentinel 
in place, and the value of the expression as a whole is 0, causing the flow of control to 
uplexit the whi 1 e loop. The effect of all of this is to copy the string s2 into the string si. 




*^§ys * 
' v>> ' * *' 


Observe that p is being initialized, not *p. The pointer p is initialized to the pointerggjp 
value si. Thus, p and si point to the same memory location. !|J 



The function strcat() is also quite simple. It can be written in a number of ways. 
ms Here is one of them: 


■ while (*p++ = *s2++) 

• . ''%:i 

As long as the expression *p++ = *s2++ is true (not zero), the body of the whi 1 e loop 
gets executed. The body, however, is empty, so its execution has no effect. All the 
action lakes place in the side effects of the expression controlling the whi 1 e loop. 

■ # rij_u 

U j i 
i 

Because the unary operator ++ is in postfix position, it has higher precedence than the * 
operator. Thus, *p++ is equivalent to *(p++), which means that p itself is being incre¬ 
mented. In contrast, the expression (*p)++ would increment what p is pointing to, leav¬ 
ing the value of p itself unchanged. (See exercise 17, on page 316.) The value of the 
expression p++ is the current value of p. This value is dereferenced; then the value of p 
in memory is incremented, causing it to point to the next character in the string. 


m 


bpPr^''' : 






char *strcat(char *sl, register const char *s2) 

{ 

register char *p = si; 



while (*p) 

++p; 

while (*p++ 

m 

f 

return si; 


s2++) 


/* go to the end */ 
/* copy */ 
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Declarations and initializations 


Dissection of the strcatO Function 

■ register char *p = si; .{ 

Observe that p is being initialized, not *p. The pointer p is initialized to the poinlfj 
value si. Thus, p and si point to the same memory location. 

■ while (*p) 

++p; 

As long as the value pointed to by p is nonzero, p is incremented, causing it to pmnt 
the next character in the string. When p points to the end-of-strmg sen me \ . 1 

expression *p has the value 0. This causes control to exit the while statement. Notice^; 

that we could have written 


while (*p != '\0') 


instead of 


while (*p) 


* - -- -> 

J 


char sl[] = 
s2[] = 


Expression 


strlen(sl) 


strlen(s2 + 8) 


strcmpCsl, s2) 


Statements 


printf("%s", si + 10); 


strcpyCsl + 10, s2 + 8); 


strcat(sl, "s!"); 


printf("%s", si); 


"beautiful big sky country" 
"how now brown cow"; 


negative integer 


What gets printed 


big sky country 


beautiful brown cows 


to achieve the same effect. Functions often are written in a sparse manner. A V ougK 
one might think that sparse C code produces faster running object code, most C coi| 
pilers produce the same object code for both forms of this whi 1 e statemen . 

■ while (*p++ = *s2++) 

S 

9 : ■ .. § 

At the beginning of this while statement, p points to the null character at the end of 
the string pointed to by si. Thus, the characters in s2 get copied one after another 
memory, overwriting the null character at the end of si and whatever follows it. Thg~ 
overall effect of this whi 1 e statement is to concatenate s2 to the end ot si. jj 




Pw- 

\.C;~ 


Tt.A' 


IH^ji ^ ! 

■ _ 
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String-handling functions are illustrated in the next table. Note carefully ^titis^j 
programmer’s responsibility to allocate sufficient space for the strings that ait passedj 

as arguments to these functions. ; 



|6.12 Multidimensional Arrays 

g*, i: 

IfThe C language allows arrays of any type, including arrays of arrays. With two bracket 
pairs, we obtain a two-dimensional array. This idea can be iterated to obtain arrays of 
« higher dimension. With each bracket pair, we add another array dimension. 


Examples of declarations of arrays 


int a[100]; 
int b[2][7]; 
int c[5][3] [2]; 


Remarks 


a one-dimensional array 
a two-dimensional array 
a three-dimensional array 


m§Mmi 

? .y fmm 



T A k-dimensional array has a size for each of its k dimensions. If we let s t represent the 
size of its ith dimension, then the declaration of the array will allocate space for 
Isi x s 2 x ... xij. elements. In the preceding table, b has 2x7 elements, and c has 
15 x 3 x 2 elements. Starting at the base address of the array, all the array elements are 
■ stored contiguously in memory. Caution: The multidimensional array mechanism 


'6 / 
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described here is often unsuitable for dealing with mathematical matrices. (See Section jj 
12.6, “Dynamic Allocation of Matrices,” on page 571, for further discussion.) 


Two-dimensional Arrays 

Even though array elements are stored contiguously one after the other, it is usually 
convenient to think of a two-dimensional array as a rectangular collection of elements f 
with rows and columns. For example, if we declare 

int a[3] [5] ; 

then we can think of the array elements arranged as follows: 



col 1 

col 2 

col 3 

col 4 

col 5 

row 1 

a[0][0] 

a[0][1] 

a[0][2] 

a[0] [3] 

a[0][4] 

row 2 

a[l][0] 

a[l][1] 

a [1] [2] 

a[l] [3] 

a[l][4] 

row 3 

a[2][0] 

a[2]Cl] 

_._ 


a[2][3] 

a[2] [4] 




Because of the relationship between arrays and pointers, there are numerous ways 
access elements of a two-dimensional array. 



*(a[i] + j) 

(*(a + i))[j] 

*((*(a + i)) + j) 
*(&a[0][0] + 5*i + j) 


The parentheses are necessary because the brackets [] have higher precedence thaifj 
the indirection operator *. We can think of a[i ] as the ith row’ of a (counting from 0), 
and we can think of a[i ] [j] as the element in the ith row, Jth column of the array! 
(counting from 0). The array name a by itself is equivalent to &a [0]; it is a pointer to an| 
array of 5 i nts. The base address of the array is &a[0] [0], not a. Starting at the base j 
address of the array, the compiler allocates contiguous space for 15 ints. 


StSt’ 




The Storage Mapping Function 




gU For any array, the mapping between pointer values and array indices is called the 
§1 storage mapping function. Consider the declaration 

int a[3] [5]; 

gJFor the array a, the storage mapping function is specified by noting that 
|j a[i] [j] is equivalent to *(&a[0][0] + 5*i + j) 

Formal Parameter Declarations 

,fi<When a multidimensional array is a formal parameter in a function definition, all 
gsizes except the first must be specified so that the compiler can determine the correct 
• ..storage mapping function. (Because of this, multidimensional arrays are often inappro¬ 
priate for use in mathematical programming; see Section 12.6, “Dynamic Allocation of 
gjtMatrices,” on page 571.) Consider the declaration 




int a[3] [5]; 


JpPh® allocates space for a 3 x 5 matrix. Its row size is 3, and its column size is 5 After 

|f|he elements of the matrix a have been assigned values, the following function can be 

J||t$ed to sum the elements of the matrix. Note carefully that the column size must be 
specified. 


$C: 

5 HSfi 

Hgs--: 


aw: 


int sum(int a[] [5]) 
{ - 

int i, j, 


sum 


for (i =0; i <3; ++i) 
for (j = 0; j < 5; ++j) 
sum += a[i][j]; 
return sum; 




the header of the function definition, the following three parameter declarations are 
equivalent: 




int a[] [5] 


int a[3] [5] 




int (*a)[5] 

pi the second declaration, the constant 3 acts as a reminder to human readers of the 
jgf^ ®> hut the compiler disregards it. In the third declaration, the parentheses are neces- 
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int sumCint a[][9][2]) 
lnt T> j, k, sum = 0 ; 



mmz" 

Split n 
B BjSfe 



i...: 


} 


for (i = 0 ; -j < 7 ; ++1 -^ 
for Cj = 0 ; j < 9 ; ++j) 
for Ck = 0 ; k < 2 ; ++IO 
sum += a[i]m[kl; 
return sum: ’ 



equivalent^ “* nm3 * a ***»* «* folto ™8 three parameter declarations ate 


int a[] [ 9 ] [ 2 ] 


int a[7][9][2] 


int (*a)[9][2] 


' L. “ l_ J 

to human o' «. 

Pi.er to generate the correct storage <* 

hons are equivalent only in a header to a function definition ‘ hrec 


Initialization 


isstv 





int a[2][3] = {1 2, 3, 4. 5 61• 

int a[2] [3] = {{ 1 , 2, 3}, {4,’ 5 6 }>- 

a[ ] [3] = {{ 1 , 2l 3}! { 4 ! 5, 6 }}’; 


§j§§|l8l!i 





-■ 


i y.«-- sfs.<to-v,' 


PSSI lfefc.«L' 

-£§jm P^T4^:! 

:#gfi| §-;••" - 
.' 



lm m T b CCS ’ then each of tho array dements am m a rcn rn 
a[l] [2] is initialized in turn. Note that the indexing ,« } ™ f L ® ] U C0] * a[0] W* - ■ 

tializers than elements in the array then the r™,' g 1 there are fewer ini " 

If the first bracket pair is "t ^ 

If we are willing to put in all the ZlcZ Z\ T * alWayS be given ^My. 
Consider the initialization - en e do not have to specify all the zeros. 


int 


ssli,..,, 

§§§g§ lilslilii: 

h 

Mfi /’ 

* 

‘ ; * tjpg< 


a[2][2][3] = { 

’ {2, 0, 0}} 

«3, 0, 0}, (4, 4, 0 }} 


1 

“ 
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An equivalent initialization is given by 

int a[][2][3] = {{{1, 1}, {2}>, {{3}, {4, 4}}}; 

If the initializers are fully and consistently braced, then wherever there are not enouf 
initializers listed, the remaining elements are initialized to zero. 

In general, if an array of storage class automatic is not explicitly initialized, the 

V.JSJ 

array elements start with “garbage” values. Here is a simple way to initialize all array 
elements to zero: 
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void add(vector x, vector y, vector z) 

... 

int i; 


y + 2 * / 
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for (i 
x[i] 
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0; i < N; ++i) 
■ Vti] + z[i]; 
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int a[2][2][3] = {0}; /* all element initialized to zero *M3 

A> vt- ^ 

Unlike automatic arrays, all static and external arrays are initialized to zero by defaultlM 
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The Use of typedef jj 

Let us illustrate the use of typedef by defining a small number of functions that oper¬ 
ate on vectors and matrices. 
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scalar dot_product(vector x, vector y) 

int i; 

scalar sum = 0.0; 

for (i =0; i < N; ++i) 
sum += x[i] * yfil • 
return sum; 
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" ftp c void multiply(matrix a, matrix b, matrix c) 

jjfi { 

^glt int i, j, k; 



#define N 3 

typedef double 
typedef scalar 
typedef scalar 


/* the size of all vectors and matrices */ 

scalar; 
vector[N]; 

matrixCN] [N]; 
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We have used the typedef mechanism to create the types sealar, vector, and matrix, 
which is both self-documenting and conceptually appropriate. Our programming lan¬ 
guage has been extended in a natural way to incorporate these new types as a domain.... 
Notice how typedef can be used to build hierarchies of types. For example, we could 

• -irf- 7 ? 

have written * 
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for (i = 0 ; i < N; ++ i) { 
for (j =0; j < N; ++j) { 
a[i][j]=0.0; 
for (k = 0; k < N; ++k) 








typedef vector matrixCN]; 


in place of 


typedef scalar matrixCN][N]; 

p5||| 

The use of typedef to create type names such as scalar, vector, and matrix allows J 
the programmer to think in terms of the application. Now we are ready to create func-1| 
tions that provide operations over our domain. .j- 



** routines are not generic because they work only with vectors and matrices of 

..f* ed We show how this limitation can be removed in Section 12.6, “Dynamic Allc 

(■(cation of Matrices,” on page 571. 
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6.13 Arrays of Pointers 

In C, array dements can be of any type, including a pointer type. Arrays of pointers 
have many uses. In this section we will write a program that uses an array of pointers to 
sort words in a file lexicographically. As we will see, an array with elements of type 
char * can be thought of as an array of strings. 

To explain how our program works, we need to imagine first how we are going to test 
it. Let us create the file input with the following two lines in it: 

A is for apple or alphabet pie 

which all get a slice of, come taste it and try. 

To sort the words in this file, we will give the command 
sort_words < input 

This is supposed to cause the following to be printed on the screen: 

A 

a 

all 

alphabet 

i f i ( i 

which 

Now we are ready to write our program. It will consist of header file and a number of .e 
files. Here is the header file and the function mai n (): 

In file sort.h 

#include <stdio.h> 

#include <stdlib.h> 

#include <string.h> 


#define MAXWORD 50 

#define N 300 


/* max word size */ 

/* array size of w[] */ 
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| void error_exit_calloc_failed(void); 

I void error_exit_too_many_words(void); 

jp" void error_exit_word_too_long(void); 

| void sort._words (char *w[], int n) ; 

| void swap(char **p, char **q); 

I void wrt_words(char *w[], int n); 

In file main.c 

I I /* Sort words lexicographically, */ 
#include "sort.h" 


int main(void) 

{ 

char word[MAXWORD]; 

char *w[N]; 
int n; 

i nt i: 


/* work space */ 

/* an array of pointers */ 

/* number of words to be sorted */ 


for (i =0; scanf("%s", word) == 1; ++i) { 
if (i >= M) 

error_exit_too_many_words(); 
if (strlen(word) >= MAXWORD) 
error_exit_word_too_long(); 
w[i] = calloc(strlen(word) + 1, sizeof(char)); 
if (w[i] == NULL) 

error_exit_calloc_failed(); 
strcpy(w[i], word); 


} 

n = i; 

sort_words(w, n); 
wrt_words(w, n); 
return 0; 


/* sort the words */ 

/* write sorted list of words */ 
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Dissection of main() in the sorLwords Program 
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#define 
#define 


MAXWORD 


50 

300 


/* max word size */ 

/* array size of w[] */ 


These two lines in the header file define the symbolic constants MAXWORD and N, which 
get used in mai n() and in other functions. We are assuming that no word in the input 
file has more than 50 characters and that there are at most 300 words in the file. 
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■ char word[MAXWORD]; /* work space V J 

char *w[N]; / an array of pointers / I 

We will use the array word to temporarily store each word that is read from the input 
file, and we expect each word to have less than MAXWORD characters. The declaration | 

char *w[N]; is equivalent to char ’-(wCN]); j 

Thus, w is an array of pointers to char. This array will eventually store all the words 
that we read from the input file. 

■ for (i = 0; scant("%s", word) == 1; ++i) { j 

• < > i * 

As long as scant() is able to read characters from the standard input hie and store 
them in word, the body of the for loop is executed. Recall that scantO takes as argu¬ 
ments a control string followed by addresses that are matched with the 
control string. Here, the second argument to scanfO is the array name y . 

Because an array name by itself is treated an address, we do not nee o wri e^ or 
Technically, it would be an error to do so. Some compilers will warn the progr 
about this kind of error; others will not. 

■ if (i >= N) 

error^exit_too„many_words(); 
if (strlen(word) >= MAXWORD) 
error__exi t_word_too_long(); 

If there are too many words, we print an error message and exit. If there a ^e toomany 
characters in the word read in by scanfO, we print an error message anc y 

ing error functions to do this, we reduce the clutter in mai n(). 

■ w[i] = calloc(strlen(word) + 1, sizeof(char)), 

The function calloc() is in the standard library, and its function prototype is in 
stdlib.h A function call such as 

calloc(n, sizeof(...)) 

dynamically allocates space for an array of n elements, with each element requiring 
si zeof (...) bytes in memory, and returns a pointer to the allocated space, e rrn 
mum number of bytes needed to store the string word, including the end-ol-stnng sen¬ 
tinel \0, is strlen(word) + 1. (The + 1 is important, and unfortunate y it is easy 
forget.) The function callocO dynamically allocates this space and returns a pointer 

to it. The pointer is assigned to w [i ]. 
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vVe could have invoked mallocO instead of callocQ. Either of the following state- 
Knents would have worked just as well: 

I w[i] = ma 1 1oc((strlen(word) + 1) * sizeof(char)); 

| w[i] = malloc((strlen(word) + 1)); 

&§Fi; 

The statements are equivalent because si zeof (char) has value 1. The reason for writ¬ 
ing it is to remind readers of the code that space for an array of chars is being 
(requested. With cal 1 oc (), the allocated space is initialized to zero, but with mai 1 oc () 
no initialization takes place. Because we are going to overwrite the space, initialization 
Hi is not needed. 

(§■ if (w[i] == NULL) 

jjlj error_exit_calloc_failed(); 

S|j$|N V' ' ' 

|y Good programming practice requires that we check the return value from callocO 
§§5 and mallocO. If the requested memory is unavailable, NULL is returned. If that is the 
if case, we print an error message and exit. 


HP:;- 
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■ strcpy(w[i], word); 

After space has been allocated, the function strcpyO is used to copy word into mem¬ 
ory starting at the address w [i ]. We can think of w as an array of words. 
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sort_words(w, n); 
wrt_words(w, n); 


/* sort the words */ 

/* write sorted list of words */ 


The function sort_words() is used to sort the words in the array w lexicographically. 
Then wrt_words() is used to write the sorted list of words on the screen. 


m -4 - - 













288 Chapter 6 ▼ Arrays, Pointers, and Strings 


6.13 T Arrays of Pointers 289 


Now we want to look at the function sort_words (). It uses a transposition sort that , s ;| 
is similar in flavor to a bubble sort. 

In file sort.c 


#include "sort.h" 
void sort_words(char *w[] 
i nt i, j ; 


int n) /* n elements are to be sorted */ 


} 


for (i =0; i < n; ++i) 

for (j = i +1; j < n; ++j) 
if (strcmp(w[i], w[j]) > 0) 
swap(&w[i], &w[j]); 


Notice that strcmpO is used to compare the two strings pointed to by w[i] and w[j]. 
If they are out of order, we use the function swap 0 to interchange the two pointer val¬ 
ues. The addresses of the pointers are passed so that the pointer values themselves can 
be changed in the calling environment by the function call. The underlying character 
strings in memory do not get swapped; only the pointers to them are interchanged. 


Before swapping 


W 


w[i] 


w[j] 
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After swapping 


W 


w[i] 


w[j] 



In file svvap.c 

#include "sort.h" 

void swap(char **p, char **q) 

I { 

char *tmp; 
tmp = *p; 
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*p = *q; 
*q = tmp; 


} 


Each of the two arguments &w[i ] and &w[ j ] that get passed to swapO is an address of 
a pointer to char, or equivalently, a pointer to pointer to char. Hence, the formal para¬ 
meters in the function definition of swapQ are of type char **. The swapO function 
is very similar to the one we wrote previously; only the types are different. Consider the 
statement 


tmp 


* 


p; 


Because p is a pointer to pointer to char, the expression *p that dereferences p is of 
type char *. Hence, we declared tmp to be of type char *. 

In file error.c 

#include "sort.h" 

voi d error_exit_cal1oc_fai1ed(void) 

{ 

printf("%s", 

"ERROR: The call to calloc() failed to\n" 

" allocate the requested memory - bye!\n"); 

exit(l); 

} 

void error_exit_too_many_words(void) 

{ 

pri ntf("ERROR: At most %d words can be sorted - bye!\n", N); 
exit(l); 

} 
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void error_exit_word_too_long(void) 

{ 

printf("%s%d%s", 

"ERROR: A word with more than ", MAXWORD, "\n" 
characters was found - bye!\n"); 

exit(l); 

} 

In file wrt.c 

#include "sort.h" 

void wrt_words(char *w[], int n) 

{ 

i nt i; 

for (i =0; i < n; ++i) 
printf("%s\n", w[i]); 


Of course, if the sort, words program were intended for serious work on a large amount 
of data, we would use a more efficient sorting algorithm. We could, for example, modify 
our mergeO and merge_sort() functions to work on arrays of pointers to char rather 
than on arrays of i nts. (See exercise 37, on page 325.) Also, we would want to count the 
words in the file and then allocate space for w[] dynamically, rather than using some 
arbitrarily large value N for the array size. (See Chapter 11, “Input/Output and the 
Oneratinp Svstpm 



/* Echoing the command line arguments 

#include <stdio.h> 

int mainfint argc, char *argv[]) 

{ 

int i: 


-..■-■■■jam 


6.14 Arguments to main() 

Two arguments, conventionally called argc and argv, can be used with mai n() to com¬ 
municate with the operating system. Here is a program that prints its command line 
arguments. It is a variant of the echo command in MS-DOS and UNIX. 







WZ&V-sfji 




mam 


printf("argc = %d\n", argc); 
for (i =0; i < argc; ++i) 
printf("argv[%dl = %s\n", 
return 0; 


C * n \ 

i]); 


The variable argc provides a count of the number of command line arguments. The 
array argv is an array of pointers to char that can be thought of as an array of strings. 
The strings are the words that make up the command line. Because the element 
argv[0] contains the name of the command itself, the value of argc is at least 1. 

Suppose we compile the above program and put the executable code in the file 
my.echo. If we give the command 

my .echo a is for apple 
the following is printed on the screen: 


argc = 

argv[0] 

argv[l] 

argv[2] 

argv[3] 

argv[4] 


my_echo 

i s 

for 

apple 


On an MS-DOS system, the string in argv[0] consists of the full pathname of the com¬ 
mand, and it is capitalized. As we will see in Section 11.12, “Environment Variables,” on 
page 521, file names are often passed as arguments to mai n(). 
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6.15 Ragged Arrays 


We want to contrast a two-dimensional array of type char with a one-dimensional array 
of pointers to char. Both similarities and differences exist between these two con- 

n j*M i i n j n 

.. ..* v* 1 ^* 9 

#include <stdio.h> 

int main(void) 

{ 

char a[2][15] = {"abc:", "a is for apple"}; 
char *p[2] = {"abc:", "a is for apple"}; 


} 


printf("%c%c%c %s %s\n", a[0][0], a[0][l], a 
printf("%c%c%c %s %s\n", p[0][0], p[0][1], p 
return 0; 


0] [2], a[0], a[l]); 
0] [2], p[0], p[l]); 


The output of this program is the following: 

abc abc: a is for apple 
abc abc: a is for apple 

The program and its output illustrate similarities in how the two constructs are used. 
Let us consider the program in some detail. 

The identifier a is a two-dimensional array, and its declaration causes space for 30 
chars to be allocated. The two-dimensional initializer is equivalent to 


{{'a', 'b\ ’ c ’, '\0'}, {’a', ' ' i', 's' 


■ }} 


ggfi 




m 


mm 


The identifier a is an array, each of whose elements is an array of 15 chars. Thus, a[0] 
and a[1] are arrays of 15 chars. Because arrays of characters arc strings, a[0] and 
a[l] are strings. The array a[0] is initialized to 

{'a', 'b', 'c', '\0 1 } 

and because only five elements are specified, the rest are initialized to zero (the null 
character). Even though not all elements are used in this program, space has been allo¬ 
cated for them. The compiler uses a storage mapping function to access a[i] [j]. Each 
access requires one multiplication and one addition. 


mm 


iff 
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The identifier p is a one-dimensional array of pointers to char. Its declaration causes 
space for two pointers to be allocated (4 bytes for each pointer on our machine). The 
element p[0] is initialized to point at "abc: ", a string that requires space for 5 chars. 
The element p[1] is initialized to point at "a is ... ", a string that requires space for 
15 chars, including the null character \0 at the end of the string. Thus, p does its work 
in less space than a. Moreover, the compiler does not generate code for a storage map¬ 
ping function to access p[i] [j], which means that p does its work faster than a. Note 
that a[0] [14] is a valid expression, but that p [0] [14] is not. The expression 
p [0] [14] overruns the bounds of the string pointed to by p[0]. Of course, a[0] [14] 
overruns the string currently stored in a[0], but it does not overrun the array a[0]. 
Hence, the expression a[0] [14] is acceptable. 

Another difference is that the strings pointed to by p [0] and p[l] are constant 
strings, and, hence, cannot be changed. In contrast to this, the strings pointed to by 
a[0] anda[l] are modifiable. 

An array of pointers whose elements are used to point to arrays of varying sizes is 
called a ragged array. Because, in the preceding program, the rows of p have different 
lengths, it is an example of a ragged array. If we think of the elements p[i] [j] 
arranged as a “rectangular” collection of elements in rows and columns, the disparate 
row lengths give the “rectangle” a ragged look. Hence the name “ragged array.” 

A ragged array 



a 

b 

c 

\0 


a 
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6.16 Functions as Arguments 

in C, pointers to functions can be passed as arguments, used in arrays, returned from 
functions, and so forth. In this section we describe how this facility works. 

Suppose we want carry out a computation with a variety of functions. Consider 

n 
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m 
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where in one instance f(fc) = si n(k), and in another instance f(k~) = 1 / k. The fol¬ 
lowing routine accomplishes the task: 

In file sum_sqr.c 

finclude "sum_sqr.h" j 

double sum__square (double f(double x), int m, int n) 

{ 

int k; 
double sum = 0.0; 

for (k = m; k <= n; ++k) 
sum += f(k) * f(k); 
return sum; 

} 

In the header to the function definition, the first parameter declaration tells the com¬ 
piler that f is a function that takes an argument of type doubl e and returns a double. 
The identifier x is for the human reader; the compiler disregards it. Thus, we could 
have written 

double sum_square(double f(double), int m, int n) 

{ 

* « » * * 

When a function occurs in a parameter declaration, the compiler interprets it as a 
pointer. Here is an equivalent header to the function definition: 

double sum_square(double (*f)(double), int m, int n) 

{ 

« « * a « 

We read the first parameter declaration as “f is a pointer to function that takes a single 
argument of type double and returns a double.” The parentheses are necessary 
because () binds tighter than *. In contrast, consider the declaration 

double *g(double); 


This declares g to be a function that takes an argument of type doubl e and returns a 
pointer to double. 
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| In the body of the function definition for sum_square(), we can either treat the 
If pointer f as if it were a function, or we can explicitly dereference the pointer. For exam- 
jf pie, we could have written 


sum += (*f)(k) * (*f)(k); 


instead of 


sum += f(k) * f(k) 


If It is helpful to think of the construct (*f) (k) as follows: 


f 

rt-p 

(*f)(k) 


the pointer to a function 
the function itself 
the call to the function 


J To illustrate how the function sum_square() might be used, let us write a complete 
if program. In main(), we will use sin() from the mathematics library and the function 
| f (), which we will write ourselves. 


In file sum_sqr.h 


#include <math.h> 
#include <stdio.h> 


double f(double x); 

double sum_square(double f(double x), int m, int n) 


In file main.c 


#include "sum„sqr.h" 


int main (void) 

{ 

printf("%s%.7f\n%s%.7f\n", 

" First computation: ", sum_square(f, 1, 10000) 
"Second computation: ", sum_square(sin, 2, 13)) 
return 0; 


In file fct.c 


#include "sunusqr.h" 


double f(double x) 

{ 

return 1.0 / x; 
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The output of this program is 

First computation: 1.6448341 
Second computation: 5.7577885 




.;.IjglSlI§ 
Mm 


6.17 r An Example: Using Bisection to Find the Root of a Function 297 


In mathematics it is known that the sum of 1 /k l from 1 to infinity is 7t 2 /6. Notice that 
the first number in the output of the program approximates this. 


Functions as Formal Parameters in Function Prototypes 

There are a number of equivalent ways to write a function prototype that has a function 
as a formal parameter. To illustrate this, let us wile a list of equivalent function proto¬ 
types for sum_sqr(): 



jggj Suppose that f is a continuous real-valued function defined on the interval |o, b]. If 
and f(b) are of opposite sign, then the continuity of the function guarantees that it 
has a root in the interval [a, b]. 



double 
double 
double 
double 
double 
double 


sum_square(double f(double x), int m, int n); 
sum_square(double f(double), int m, int n); 
sum__square(double f(double), int, int); 
sum_square(double (*f)(double), int, int); 
sum_square(double (*)(double), int, int); 
sum_square(double g(double y), int a, int b); 
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In the above list, the identifiers m and n are optional; if present, the compiler disregards 
them. In a similar fashion, the identifier f in the construct (*f ) is optional. 

In C, a function can have multiple prototypes, provided that they are all equivalent. 
Thus, when we put the above list in the header file sum_sqr,h, our program compiles 
and runs just as before. 



-4C" 


■■N — 


' " ", •• ■•.. .- 


........... _, iS 





6.1 7 An Example: Using Bisection to Find the Root 
of a Function 

An important problem that arises in engineering, mathematics, and physics is to find 
the root of a given real-valued function. A real number x that satisfies the equation 
fix) = 0 is called a root of f. In simple cases—for example, if f is a quadratic polyno¬ 
mial—a formula for the roots is known. In general, however, there is no formula and a 
root must be found by numerical methods. 
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Finding a root by bisection 


graph of f 


f(a) 


root 

/ 


midpoint 


f(m) 


fib) 


Notice that the condition that fid) and fib) have opposite sign is equivalent to the prod¬ 
uct fid) x fib) being negative. The method of bisection proceeds as follows. Let m be 
the midpoint of the interval. If fim) is zero, we have found a root. If not, then either f{a) 
and f{m) are of opposite sign or fim) and f(b) are of opposite sign. Suppose that the 
first case holds. We then know that f has a root in the interval [a, m ], and we now start 
the process over again. After each iteration, we obtain an interval that is half the length 
of the previous interval. When the interval is sufficiently small, we take its midpoint as 
an approximation to a root of f. In general, we cannot hope to find the exact root. For 
most functions the precise mathematical root will not have an exact machine represen¬ 
tation. Here is a program that finds an approximate root of the polynomial: 

x 3 - 7 x - 3 

In file find_root.h 

#include <assert.h> 

#include <stdio.h> 

typedef double dbl; 
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extern int cnt; 

extern const dbl eps; /* epsilon, a small quantity */ 

dial bi secti on(dbl f(dbl x) , dbl a, dbl b) ; 
dbl f(dbl x); 

In file main.e 

/ K Find a root of f() by the bisection method. */ 

#include "find_root.h" 
int cnt = 0 ; 

const dbl eps = le-13; /* epsilon, a small quantity */ 

int main(void) 

{ 

dbl a = - 10 . 0 ; 
dbl b = + 10 . 0 ; 
dbl root; 

assert(f(a) * f(b) <= 0 . 0 ); 

r 0 ° t 4 .£^uo/ S ^ c lP 0n ^’ a ’ k); /* recursive fct call */ 

pnntf( %s%d\n%s% .15f\n%s% .15f\n", 

"No. of fct calls: ", cnt, 

"Approximate root: ", root, 

Function value: ", f(root)); 
return 0 ; 

} 

In file bisections 

#include "find_root.h" 

dbl bisection(dbl f(dbl x), dbl a, dbl b) 

{ 

dbl m = (a + b) / 2 . 0 ; /* midpoint */ 

x /* # of fct calls */ 

if (f(m) == 0.0 || b - a < eps) 
return m; 

else if (f(a) * f(m) < 0 . 0 ) 

return bisection(f, a, m); 
else 

return bisection(f, m, b); 

} 
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In file fct.c 


#include "find_root.h" 


dbl f(dbl x) 

{ 

return (x * x * x * x * x 

} 


7.0 * x 


3.0); 


In mathematics, the Greek letter "epsilon,” written e, is often used to represent a small 
positive quantity (number). In mathematical programming, the identifier eps is often 
used to play the role analogous to e in mathematics. 

In niain(), a has value -10 and b has value +10. Thus, the interval [a, bj is the same 
as the interval [-10, +10J. Clearly, f (-10) is negative and f (+10) is positive. That is, f 
takes on values having opposite signs at the end points of the interval [a, b], which 
means that the bisection method can be used to find an approximate root of f. It so 
happens that f has three real roots and a pair of complex roots. The bisection method 
will only find a real root, and which one depends on the interval [a, bj initially chosen. 
Here is the output from the program: 


No. of fct calls: 49 
Approximate root: 1.719628091484431 
Function value: -0.000000000000977 


If we use le-15 instead of le-13 as the value for eps in mai n(), then the function 
value printed out would be zero. We did not use le-15 in our program, because we did 
not want to give the reader the impression that an actual root was found. If we change 
our program to use long doubles instead of doubles, change the value of eps from 
le-13 to le-31, and change the formats % .16f in the printf () statement to % . 33f, 
then the following gets printed on our system: 


No. of fct calls; 109 

Approximate root: 1.719628091484457524492214890955915 
Function value: -0.000000000000000000000000000000555 


No matter how much work w'e do, we do not expect to get an exact decimal representa¬ 
tion of the root. Although we are not sure, the root is probably an irrational number. 

The blank space following the % in the formats causes a blank to be printed if the cor¬ 
responding argument is positive, but has no effect if the argument is negative. This pro¬ 
vides a mechanism for lining up positive and negative numbers. (See Section 11.1, “The 
Output Function pri ntf (),” on page 493, for details about formats.) 
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In file kepler.c 

#include "kepler.h" 

dbl kepler(dbl x) 

{ 

return (x - e * sin(x) 

} 


m); 


I he bisecti on () function is the same as the one we wrote Section 6.17, “An Example: 

Using Bisection to Find the Root of a Function,” on page 298. Here is the output of the 
program: 


No. of fct calls: 
Approximate root: 
Function value: 


2.499454528163501 
0.000000000000000 


6.18 Arrays of Pointers to Function 


In the kepler program that we discussed in the previous section, the bisection() 
function takes as its first argument a function, or more precisely, a pointer to function. 
In C, a function name by itself is treated by the compiler as a pointer to the function. 
(This is analogous to the idea that an array name by itself is treated as a pointer to the 
base of the array in memory.) Technically speaking, w'hen we pass kepler as an argu¬ 
ment to bisectionO, it is the address of keplerf) that gets passed. To make the 
pointer nature of the identifier kepler more evident, here is another way to write 
main() in the kepler program: 

int main(void) 

{ 


dbl a = -100.0; 

dbl b = +100.0; 

dbl root; 
dbl (*pfdd)(dbl); 

pfdd = kepler; 

assert(pfddCa) * pfdd(b) <= 0 . 0 ) 
root = bisection(pfdd, a, b); 


/* ptr to fct taking dbl 
and returning dbl */ 


/* recursive fct call */ 
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Jim iys#;: 

> | 


■: " 


„ 


Observe that pfdd is a pointer variable of type “pointer to function taking a single argu¬ 
ment of type double and returning a double.” Like other C pointers, pfdd is strongly 
typed and can only be assigned pointer values of the same type. Because kepi er () is a 
function taking a double and returning a double, we can assign kepler to pfdd. 
Because the type for pfdd matches the type of the first parameter in the function defi¬ 
nition for bi sectionO, we can pass pfdd as an argument to bisection(). 

In our next program we create an array of pointers, each of type “pointer to function 
taking a double and returning a double.” We then deal with each array element (func¬ 
tion pointer) in a for loop. Caution: In this program pfdd will be a type, not a pointer 
variable. 






V 


In file find_roots.h 


if! ■■ 


#include <assert.h> 
#include <math.h> 
#include <stdio.h> 




#define 
typedef 

j iV 


N 4 


/* size of array of ptrs to fcts */ 


double dbl; 


/ 

// Create the type "ptr to fct taking a dbl and returning a dbl." 
*/ 

typedef dbl (*pfdd)(dbl); 


extern int cnt; 

extern const dbl eps; 


/* epsilon, a small quantity */ 


is 


dbl 

dbl 

dbl 

dbl 


bisection(pfdd f, dbl a, dbl b) 
flCdbl x); 
f2(dbl x); 
f3(dbl x); 
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In file main.c 


/* Use bisection to find roots. */ 
#include "find_roots.h" 


int cnt = 0; 

const dbl eps = le-13; 

int main(void) 

{ 


/* epsilon, a small quantity 


int begin_cnt; 

int i; 

int nfct_calls; 

dbl a = -100.0; 

dbl b = +100.0; 

dbl root; 

dbl val; 

pfdd f[N] = {NULL, fl, f2, f 3} ; 

for (i =1; i < N; ++i) { 

assert(f [i] (a) * f[i](b) <= 0.0); 
begin_cnt = cnt; 
root = bisection(f[i], a, b); 
nfct_calls = cnt - begin_cnt; 
val = f[i](root); 

printf("%s%d%s% .15f\n%s%d%s% .15f\n%s%3d\n\n", 
"For f[", i, "] (x) an approximate root is x0 
Fct evaluation at the root: f[", i, "](x0) 


root 
val. 


Number of fct calls to bisectionQ =", nfct_calls) 


return 0 


In file bisections 


#include "find_roots.h" 

dbl bisection(pfdd f, dbl a, dbl b) 

{ 

dbl m = (a + b) / 2.0; /* midpoint */ 


i t * 

'**♦¥* **••*» I lit 

i 1 V H L ) 

if (f(m) ==0.0 || b - a < eps) 
return m; 

else if (f(a) * f(m) < 0.0) 
return bisection(f, a, m); 

p] Cp 

V** 1 >Wh»» 

return bisection(f, m, b); 


/* # of fct calls */ 







m?: v- 
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dbl fl(dbl x) 

{ 

return (x*x*x - x*x + 2,0*x 

} 


- 2 . 0 ); 


dbl f2(dbl x) 

{ 

return (sin(x) - 0.7*x*x*x + 3.0); 

} 
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dbl f3(dbl x) 

{ 

return (exp(0.13*x) - x*x*x); 

} ' 

When we execute this program, here is what gets printed: 


For f[l](x) an approximate root is x0 
Fct evaluation at the root: f[l](x0) 
Number of fct calls to bisectionQ 

For f[2](x) an approximate root is x0 
Fct evaluation at the root: f[2](x0) 
Number of fct calls to bisectionQ 

For f[3](x) an approximate root is x0 
Fct evaluation at the root: f[3](x0) 
Number of fct calls to bisectionQ 


1.000000000000023 
0.000000000000069 
5 2 

1.784414278382185 

0.000000000000169 

C J 

1.046387173807117 

-0.000000000000134 

52 





Dissection of the find_roots Program 


/ 

// Create the type "ptr to fct taking a dbl and returning a dbl." 

V 

typedef dbl (*pfdd)(dbl); 


This typedef makes pfdd a new name for the type “pointer to function taking a single 
argument of type double and returning a double.” We will see that the use of pfdd 
makes the code easier to write and easier to read. 
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dbl bisection(pfdd f, dbl a, dbl b); 


: . • 

«s 


This is the function prototype for bisectionC). Because pfdd is a typedef, the paramll 
eter declaration pfdd f is equivalent to what one gets by substituting f for pfdd in the 
typedef itself. Thus, pfdd f is equivalent to dbl (*f) (dbl), and two other equivalent 
function prototypes for bisectionC) are 


dbl bisection(dbl f(dbl), dbl a, dbl b); 
dbl bisection(dbl f(dbl x), dbl a, dbl b) 


• • 


-• 


. - : 


Although these are longer, some programmers prefer them. Any of these styles is 
acceptable; 




■ pfdd f[N] = {NULL, fl, f2, f3}; 

We declare f to be an array of N elements, with each element having type pfdd. The 
array is initialized with four pointer values, the first being NULL. The initialization 
causes f [0] to be assigned NULL, f [1] to be assigned to fl, f [2] assigned to f2, and 
so forth. We will not use f [0] . The following declarations are equivalent: 

a pfdd f[N] = {NULL, fl, f2, f3}; 
pfdd f[N] = {NULL, &fl, &f2, &f3}; 
dbl (*f[N])(dbl) = {NULL, fl, f2, f3>; 

dbl (*f[N])(dbl) = {NULL, &fl, &f2, &f3}; 

We can choose any one from this list to use as our declaration of f . A function name by 
itself, such as fl, can be thought of as a pointer. But we may also think of it as the 
name of a function with &f 1 being a pointer to the function. ANSI C allows ambiguous 
thinking in this matter. 

H root = bisection(f[i] , a, b); 

Here, the pointer f [i] is being passed as an argument to bisectionC). Equivalently, 
we could have written 

root = bisection(*f [i], a, b); 

When we dereference the pointer, we get the function that it points to. But the compiler 
treats a function by itself as a pointer to the function, so the two ways of writing this 
are equivalent. The words “function by itself” mean that the function is not followed by 
parentheses. If a function name or a function pointer or a dereferenced function 
pointer is followed by parentheses, then we have a call to the function. 


T,v «iPj 


















mt: 


f [i ] (a) f[i](b) f [i ] (root) 

IjjlThese are calls to the function pointed to by f [i ]. Tf we wish, we can write 
(*f[i])(a) (*f[i])(b) (*f[i])(root) 


mmur 

mkP'-n ; ■ ; 
W&Mk y" : 

f|®!P:; 


ft* instead. The output shows that f [i] (root) for each i is close to zero, as it should be. 








jljNote that each of our functions fl(), f2(), and f 3 () happens to take on values with 
jjj opposite signs at the end points of the interval [-100, 100]. Our calls to bisectionC) 
If would not work if this condition does not hold. 
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6.19 The Type Qualifiers const and volatile 

The keywords const and vol ati 1 e have been added to the C language by the ANSI C 
committee. These keywords are not available in traditional C. They are called type qual¬ 
ifiers because they restrict, or qualify, the way an identifier of a given type can be used. 

Let us first discuss how const is used. Typically, in a declaration const comes after 
the storage class, if any, but before the type. Consider the declaration 




static const int 


k 




ii 

if?,. 

ifp 

■a 

ffff- 


iitf 
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m 




mm 

.v 
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We read tliis as “k is a constant i nt with stati c storage class.” Because the type for k 
has been qualified by const, we can initialize k, but thereafter k cannot be assigned to, 
incremented, decremented, or otherwise modified. 

In C, even though a variable has been qualified with const, it still cannot be used to 
specify an array size in another declaration. In C++, however, it can be used for this 
purpose. This is one of the places where C and C++ differ. 


MP 

nil 






c 

mm 


const int 
i nt 


n = 3; 
v[n]; 


/* any C compiler should complain */ 






In some situations we can use a const-qualified variable instead of a symbolic con¬ 
stant; in other situations, we cannot. 

An unqualified pointer should not be assigned the address of a const-qualified vari¬ 
able. The following code illustrates the problem: 
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■m^mm 

■ r 


const int 
i nt 


a = 7; 

*p = &a; 


/* the compiler will complain */ 


Here is the reason why the compiler complains. Since p is an ordinary pointer to im 
we could use it later in an expression such as ++*P to change the stored va ue o a,v^ 
lating the concept that a is constant. If, however, we write ,Jj 








W§0> 

J§ 2 &> :■&/ 

.y; 

Jig 
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const int 
const int 


7- 

: &a 


aa*«jlB l 1 I „ 

. | S3t r i 

• • 

H 


v">V., /.V . ."1 . i 

■ s S-, :i! 



then the compiler will be happy. The last declaration is read p P 

stant i nt and its initial value is the address of a.” Note that p itself is notco t.¥g, 

can assign to it some other address. We may not, however, assign a valu p. Thg 

object pointed to by p should not be modified. . ■ 

Suppose we want p itself to be constant, but not a. This is ac ieve wi e o ow m|| 

declarations: 







V - - .. 


qmmmm 'Wtm* %- 
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int 
i nt 


const 


a; 

p = &a 



wmm 


We read the last declaration as “p is a constant pointer to int, and 

address of a.” Thereafter, we may not assign a value to p, but we may assign a v u to 

*p. Now consider § 


" v * 



ssk&sxZ/zztZtSF 





const int 
const int * 


const 


a = 7; 
p = &a 


0->' • • '- ; . 



The last declaration tells the compiler that p is a constant pointer to a constant i nt. 

Neither p nor *p can be assigned to, incremented, or decremented. 

In contrast to const, the type qualifier vol ati 1 e is seldom ^ se ^; V ^,^ ) ^ c ^ 
is one that can be modified in some unspecified way by the hardw 
declaration 

extern const volatile int real_time_clock, 

The extern means “look for it elsewhere, either in this file or in some ® e '”^ 
qualifier vol ati 1 e indicates that the object may be acted on b V todwar . 
const is also a qualifier, the object may not be assigned to, mcrem , d 
mented within the program. The hardware can change the clock, but the code 
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Summary 

^-•Tr 

UK 

#§1 The brackets [] are used in a declaration to tell the compiler that an identifier is an 
array. The integral constant expression in the brackets specifies the size of the 
jjfb array. For example, the declaration 

(Jr 

§f int a[100]; 

W-ddh-VT .. . 

^4%, ! 

gp 

11 causes the compiler to allocate contiguous space in memory for 100 ints. The ele* 
■l ments of the array are numbered from 0 to 99. The array name a by itself is a con¬ 
s' stant pointer; its value is the base address of the array. 

in 

m 

32 A pointer variable takes addresses as values. Some typical values are NULL, 
||; addresses of variables, string constants, and pointer values, or addresses, returned 
3 from functions such as cal1oc(). If allocation fails—for example, the system free 
p| store (heap) is exhausted—then NULL is returned. 

te- 

:JfC- ■ 

v 3 The address operator & and the indirection or dereferencing operator * are unary 
?!' operators with the same precedence and right to left associativity as other unary- 
operators. If v is a variable, then the expression 


K. 


*&v 


is equivalent to 


Pointers are used as formal parameters in headers to function definitions to effect 
“call-by-reference.” When addresses of variables are passed as arguments, they can 
be dereferenced in the body of the function to change the values of variables in the 
calling environment. 

In C, arrays and pointers are closely related topics. If a is an array and 1 is an i nt, 
then the expression 


a[i] 


is equivalent to 


*(a + i) 


These expressions can be used to access elements of the array. The expression a + 
i is an example of pointer arithmetic. Its value is the address of the element of the 
array that is i elements beyond a itself. That is, a + i is equivalent to &a [i ]. 
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ft 1 Fo 1 ur values fi e t Printed when the following code is executed. How many of those 
|§. ( values are the same? Explain. 5 
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char 

int 

int 


*format = "%p %d %d %d\n" 
i = 3; 

*p = &i; 


printf(format, p, *p + 7 , 3 * **&p + 1, 5 * (p 


2 ))) 


2 0n f ° f ° U J cornpilers warned us about integer overflow for the expression p - fn 
- 2) m the previous exercise. Modify the program that you wrote in the previous 
exercise so that it prims the integer values of both p and p - 2. Does it seem possh 
ble that mteger overflow can occur? (See the next exercise for further discussion.) 

3 Consider the following program: 

#include <stdio,h> 

#include <stddef.h> 

int main(void) 

int a, b, *p t= &a, *q = &b: 

ptrdiff_t diff = p - q; 

printf("diff == %d\n", diff); 
return 0; 




n^ S S“ renCe ? tW ° pmnter sessions must be a signed integral type. 

Innn n ^ ’ the tyP6 18 1 nt ’ and on 1110st MS-DOS systems the type is 

long. On all ANSI C systems, the type is given in the standard header file stddefh 
by a type definition of the following form: 

typedef type ptrdiff_t; 

^ ty f 3edeP “ stdde f- h on your system so that you will know the type for 

1 I u N V, hal %d 1S appropriate in the P ri ntf O statement if di ff has type i nt 
and that XId is appropriate if di ff has type long. Run the program so that vou 

un erstand its effects. Then modify the program by adding the following two lines: 
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wmm 
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diff = p - (int *) 0; 
printfC'diff = %d\n", diff); 




.. ■ 1 


Hi 






u?m 

i# 




wmm 


mm 


Are you surprised by what gets printed now? Do you understand the reason for the|f||"~ 
compiler warning that was discussed in exercise 2, on page 311? Explain. If i nt * is J 
replaced by ptrdi ff_t *, does the program act any differently? JJgjj 
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mm 
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4 If i and j are i nts and p and q are pointers to i nt, which of the following assign¬ 
ment expressions are not legal? 


■SIM 



&i 

(*&)j 


p = &*&i 

i = *&*&j 


i = (int) p q = &p *q = &j 
i = *p++ + *q 


£v‘ 


Wn 
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5 When variables are declared, are they located in memory contiguously? Write a pro¬ 
gram with the declaration 
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char a, b, c 


. V P, 


*q, *r 
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and print out the locations that are assigned to all these variables by your compiler.!!! 
Are the locations in order? If the locations are in order, are they increasing or ! 
decreasing? Is the address of each pointer variable divisible by 4? If so, this proba-~?fr 
bly means that each pointer value gets stored in a machine word. ‘j 

6 The following program uses %p formats to print out some addresses: |j 

#include <stdio.h> 


"0%; 




St 






int main(void) 

{ 

int a = 1, b 


2, c 



mv' 

iljpfA 




printf("%s%p\n%s%p\n%s%p\n", 
"&a = ", 4a, 

"&b = ", &b, 

"&c = ", &c); 
return 0; 


If the variables a, b, and c are not initialized, does the program produce the same 
output? What happens on your system if you change %p to %d? Does your compiler 
complain? (It should.) If possible, run your program on an MS-DOS system. Because 
a pointer is 4 bytes and an i nt is 2 bytes, the %d format is inappropriate and can 
cause a negative number to be printed. 
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If you want to see addresses printed as decimal numbers rather than hexadecimals, 
it is usually safe to cast an address as an unsi gned 1 ong and use the %1 u format 
Try this on your system by replacing the pri ntf () statement in exercise 6, on page 
312, by 

printf("%s%lu\n%s%lu\n%s%lu\n", 

"&a = ", (unsigned long) &a, 

"&b = ", (unsigned long) &b, 

"&c = ", (unsigned long) &c); 

What gets printed? Explain. 

#include <stdio.h> 

typedef unsigned long ulong; 

int main(void) 

{ 

char *pc = NULL; 

int *pi = NULL; 

double *pd = NULL; 

long double *pld = NULL; 

printf("%51u%51u\n%51u%5lu\n%51u%51u\n", 

(ulong)(pc + 1), (ulong)(pi + l), 

(ulong)(pd + 1), (ulong)(pld + 1), 

(ulong)(pc + 3), (ulong)(pld + 3)); 
return 0; 

} 

The following array declarations have several errors. Identify each of them. 

#define N 4 

int a[N] = {0, 2, 2, 3, 4}; 
int b[N - 5]; 
int c[3.0]; 
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10 In the following program the invocation of change_i t() seems to have no effeq||j 
Explain. 
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#include <stdio.h> 

void change_it(int []); 

int main(void) 

{ 

int a[5], *p; 
p = a; 

printf("p has the value %p\n", p); 
change_it(a); 
p = a; 

printf("p has the value %p\n", p); 
return 0; 


void change_it(int a[]) 

{ 

int i = 777, *q = &i; . 

a = q; /* a is assigned a different value V 

} 
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11 what is wrong with the following program? Correct the program and explain th^B 
meaning of its output. Illl 

#include <stdio.h> S 

int main(void) 

{ ; il 

int a[] = {0, 2, 4, 6, 8}, ffj| 

*p = a + 3; 1 


1 mi 
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printf("%s%d%s\n%s%d%s\n", 
"a[?] = ", *p, "?'\ 

"a[?+l] = ", *p + 1, "?") 
return 0; 


^#1 


12 A real polynomial p(x) of degree n or less is given by fjj 

p(x) — a 0 + a x x -r a 2 x ' + • • ■ +- a n x .|j| 

with the coefficients a 0 , a x .a„ representing real numbers. If an != 0, then the j 

degree of p(x) is n. Polynomials can be represented in a machine by an array such as ^ 


feff 


▼ Exercises 315 

#define N 5 /* N is the max degree */ 

double p[N + 1]; 

Write a function 

double eval(double p[], double x, int n) /* n is max degree */ 

{ 

m % m * i 

that returns the value of the polynomial p evaluated at x. Write two versions of the 
function. The first version should be written with a straightforward naive approach. 
The second version should incorporate Horner’s Rule. For fifth-degree polynomials, 
Horner’s Rule is expressed by writing 

p(x) =a 0 + x(a x + (a 2 + x(a 3 +x(a 4 +x(a 5 ))))) 

How many additions and multiplications are used in each of your two versions of 
the eval () function? 

13 Write a function that adds two polynomials of at most degree n. 

/* f = g + h; n is the max degree of f, g, and h */ 

void add(double f[], double g[], double h[], int n) 

{ 

ft MS * * 9 

14 Write an algorithm to multiply two polynomials of at most degree n. Use your func¬ 
tion add() to sum intermediate results. This is not very efficient. Can you write a 
better routine? 

15 Modify the function bubble() so that it terminates after the first pass in which no 
two elements are interchanged. 

16 Modify mergesortO so that it can be used with an array of any size, not just with a 
size that is a power of 2. Recall that any positive integer can be expressed as a sum 
of powers of two—for example, 

27 ~ 16 +8+2+1 


Consider the array as a collection of subarrays of sizes that are powers of 2. Sort 
the subarrays, and then use mergeQ to produce the final sorted array. 
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, i tr.rrt QVTM*Dccinnc ni-f riticl (*d)++ have different effects, 

17 If p is a pointer, then the two expressions p++ emu c 

With the following code, what gets printed? Explain. 


wm.' . 


KiSfl; 
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char 

char 

int 


a[] = "abc" 


«p; 

* 

i; 


P _ ■ 

for (i =0; i <3; ++i) 
printf("XcXn", *p++); 
printf("a = %s\n M , a); 
p = a; 

for (i =0; i <3; ++i) 
pri ntf ( T, %c\n H , (*p)++); 
printf( n a = %s\n", a); 

18 I ook carefully at the function definition for strcpyO given in Section 6.11, “Strmg- 

Handling Functions in the Standard Libi aiy, on page , 
sense to copy the tail end of a string onto its beginning. With the following code, 

what gets printed? Explain. 

char a[] = "abedefghijklmnopqrstuvwxyz"; 
char *p = a; 

char *q = a + strlen(a) - 3; 

printf("a = %s\n", a) ; 

strcpyCp, q); 

printf("a = %s\n", a); 

Explain what would go wrong if we were to interchange p and q in the call to 
strcpyO. 

19 Write a program that will test the relative: efficiency_of’the ^ 

in Section 6.7, “An Example: Bubble Sort,” on page 257, versus the function merg 

^ ^ . f1lo „, mrin , K pvprrise Generate test data by using randO 

sortO that you wrote m the previous exeicise. vreueicuc * 

to fill arrays. Run your program on arrays of various sizes, say wi ’ ’ 

and 1,000 dements. Plot the running time for each sort versus the size of the: array. 

For large array sizes, you should see the growth indicated by t e ormu as S 

the text. For small array sizes, there is too much overhead to detect this gro P 

tern. If you are unfamiliar with how to time program execution, see Section 11.16 

“How to Time C Code,” on page 528. If you are on a UNIX system, you can give 

following command to time a program: 
time pgm 
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▼ Exercises 317 


20 A palindrome is a string that reads the same both forward and backward. Some 
examples are 

"ABCBA" "123343321" "otto" "i am ma i" "C" 

Write a function that takes a string as an argument and returns the i nt value 1 if 
the string is a palindrome and returns 0 otherwise. If UNIX is available to you, how 
many palindromes can you find in the file / usr/dict/words ? 

21 Modify your palindrome function from the previous so that blanks and capitals are 
ignored in the matching process. Under these rules, the following are examples of 
palindromes: 

"Anna" "A man a plan a canal Panama" "ott o" 

If UNIX is available to you, how many more palindromes can you find in the file / 
usr/dict/words? 

22 What gets printed? Explain. 

printf("%c%c%c%c%c!\n", 

"ghi"[1], *("def" + l), 

*"abc" + 11, "klm"[1], *"ghi" + 8); 

23 In Section 3.4, “The Data Type i nt,” on page 116, we saw that the largest value that 
can be stored in a 1 ong i nt is approximately 2 billion. In many applications, such 
numbers are not big enough. For example, the federal government has to deal with 
figures in the trillions of dollars. (Or is it quadrillions?) In this exercise, we want to 
explore, in a primitive fashion, how two large integers can be added. Here is a pro¬ 
gram that will do this: 


#include 

<assert.h> 



#include 

<stdio.h> 



#define 

N 20 

/* size of all 

arrays */ 

typedef 

const char cchr; 



void add(int sum[], int a[], 

void wrt(cchr *s, int a[]); 

int b[] ) ; /* sum = 

= a + b */ 
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int main(void) 

{ 

int a[N] = {7, 5, 9, 8, 9, 7, 5, 0, 0, 9, 9, 0, 

int b[N] = {7, 7, 5, 3, 1, 2, 8, 8, 9, 6, 7, 7}; 

int sum[N]; 
int ndigits; 

add(sum, a, b); 
wrt("Integer a: ", a); 

wrt("Integer b: ", b); 

wrt(" Sum: ", sum); 

return 0; 


void add (int sum[], int a[], 

int carry = 0; 
int i; 


int b[]) 


for (i = 0; i < N; ++i) { 

sum[i] = a[i] + b[i] + carry; 
if (sum[i] < 10) 
carry = 0; 
else { 

carry = sum[i] / 10; 
sum[i] %= 10; 

} 

} 

} 


void wrt(cchr *s, int a[]) 

{ 

int i; 

printf("%s", s); 

/* 

// Print leading zeros as blanks. 

*/ 

for (i = N - 1; i > 0 && a[i] == 0; --i) 
putchar(’ ’); 

/* 

// After a leading digit greater than zero is found, 
// print all the remaining digits, including zeros. 
V 

for ( ; i >= 0; —i) 
printf("%d", a[i]); 
putchar('\n’); 



} 


▼ Exercises 319 


When we execute this program, here is what appears on the screen: 

Integer a: 88099005798957 

Integer b: 776988213577 

Sum: 88875994012534 

Note that the digits are stored in array elements going from element 0 to element 
N - 1, but that the digits are printed in the opposite order. To understand this pro¬ 
gram, review how you learned to do addition in grade school. Write a similar pro- 
, gram that computes the product of two integers. 

24 The sizeof operator can be used to find the number of bytes needed to store a 
type or an expression. When applied to arrays, it does not yield the size of the array. 
What gets printed? Explain. 

#include <stdio.h> 

void f(int a[]) ; 

int main(void) 

{ 

char s[] = "deep in the heart of texas"; 

char *p = "deep in the heart of texas"; 

int a [3]; 

double d[5]; 

pri ntf("%s%d\n%s%d\n%s%d\n%s%d\n", 

"sizeof(s) = ", sizeof(s), 

"sizeof(p) = ", sizeof(p), 

"sizeof(a) = ", sizeof(a), 

"sizeof(d) = ", sizeof(d)); 
f (a) ; 
return 0; 

} 

void f(int a[]) 

{ 

printf("In f(): sizeof(a) = %d\n", sizeof(a)); 

} 

25 If UNIX is available to you and you are familiar with the diff utility, try the following 
experiment. Use the two statements 

printf("abc\n"); and printf("a%cb%cc\n", ’\0', '\0'); 

to write two versions of an elementary program that writes on the screen. Use redi¬ 
rection to put the printout of the respective programs into two files, say impl and 
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T Exercises 323 


31 What gets printed? Explain. 

#include <stdio.h> 

void try_me(int [][3]); 

int main(void) 

{ 

int a[3][3] = {{2, 5, 7}, {0, -1, -2}, {7, 9, 3}}; 

try_me(a); 
return 0; 

} 

void try_me(int C*a)[3]) 

{ 

printf("%d %d %d %d ... infinity\n", 
a[l][0], -a[l] [1], a[0] [0], a[2][2]); 

} 

Now, change the declaration of the parameter in the header of the function defini¬ 
tion of try_meO to 

int *a[3] 

and leave the rest of the code alone. Does your compiler complain? (It should.) 
Explain. 

32 Choose a character and use a two-dimensional array that matches the size of your 
screen to graph on the screen the functions si n() and cos() from 0 to 2k. Because, 
on most screens, the space in which a character is printed is not square, there is 
horizontal/vertical distortion. Experiment with your graphs to see if you can 
remove this distortion. 

33 Write out a dissection for the following program. An understanding of the storage 
mapping function is needed to explain it. A complete explanation of the last 
pri ntf () statement is rather technical and should be attempted only by advanced 
computer science students. 






#include <stdio.h> 
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a[i][j]); 


int main(void) 

{ 

int a[3][5], i, j, 

*P = *a; /* a nice initialization! */ 

for (i =0; i <3; ++i) 
for (j =0; i < 5: ++i) 
a[i] [j] = i * 5 + j; 
for (i =0; i <3; ++i) 
for (j = 0; j < 5; ++j) 

printf("%s%12d", (j == 0) ? "\n" 
printf("\n"); 
for (i =0; i <15; ++i) 

printf("%s%12d", (i % 5 == 0) ? "\n" : *(p + i)) ; 

printf("\n\n% 12 d% 12 d\n% 12 d% 12 d\n% 12 d% 12 d\n% 12 d% 12 d\n\n", 

**a, **(a +1), 

*(a[0] + 1), *(*a + 1), 

*(a[l] + 2), *(*(a +1) +2), 

*(a[2] + 3), *(*(a + 2) + 3)); 
printf("%-lls%s%12d\n%-lls%s%12d\n%-lls%s%12d\n\n", 

"(int) a", (int) a, 

"(int) *a", "=", (int) *a, 

"(int) **a", "=", (int) **a); 
return 0; 

> 

34 Modify the my_echo program in Section 6.14, “Arguments to mai n (),” on page 291, 
so that it will print out its arguments in capital letters if the option -c is present. Do 
not print out the argument that contains the option. 
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35 Complete the following table: 


Declarations and initialization 


p[2][3] 


Expression 



*(p[l][2] + 2) 


= { abc", "d 
"pqrstuvw 


Equivalent expression 


p[0][0][0] 


Valu 


'a' 


/* error */ 



36 Simulations that involve the repeated use of a random-number generator to repro¬ 
duce a probabilistic event are called Monte Carlo simulations, so called because 
Monte Carlo has one of the world’s most famous gaming casinos. In this exercise we 
want to find the probability that at least two people in a room with n people have 
birthdays that fall on the same day of the year. Assume that there are 365 days in a 
year, and assume that the chance of a person being born on each day of the year is 
the same. A single trial experiment consists of filling an array of size n with inte¬ 
gers that are randomly distributed from l to 365. If any two elements in the array 
have the same value, then we say that the trial is “true.” Thus, a true trial corre¬ 
sponds to the case when at least two people in the room were born on the same day 
of the year. Simulate the probability by running, say, 10,000 trials with n people in 
the room. Do this for n- 2,3,..., 100. You can use the expression 

rand() % 365 + 1 

to compute the day of birth for each person. (Use a better random-number genera¬ 
tor, such as 1 rand48(), if one is available to you.) The number of true trials divided 
by 10,000 is the computed simulated probability. What value of n yields the break¬ 
even point? That is, find the least n for which the probability is 1/2 or more. 
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37 Modify the function merge!) given in the tevr arm e ■, 

you wrote in exercise 16, on pL 11 5 „ ' 1 ™ C ' ,<m ""^esortO that 

than on arrays of i nts. Modify the son. words program to use these^ funedon r " 
both versions of the program on a large file The program that 11Q / hme 

should run much faster. Does it? P g that uses mer 9esort() 


38 The following code can be used to reverse the characters in a string: 

char * reverse(char *s) 

{ 

char *p, *q, tmp; 
int n; 


n = strlen(s); 

q = (n > 0) ?s + n- l 
for (p = s; p < q; ++p, 
tmp = *p; 

*p = *q; 

*q = tmp; 
return s; 


-q) { 


Test this function by writing a program that contains the following lines: 

rev b e C ?se f ?si;»r 0Pt ' rStU '' WXyZ " : 

Execute your program so that you understand itQ 'vu $ n 

is in memory a, the beginning of (he for loop »*a, 


b eld 


Explain why this picture is correct. Draw a series of simiinr 

the state of the memory after each iteration of the for loop. P ^ ^ descnbe 

33 Remembet that arge and arov are tvmraiit/ ncn/i ^ 

arge is the number of command imo ^ s ar ^ uments t0 ^ain(). Because 
the array argv” a rg c bn “ha, ™ a“T“ ,S ’ «“ "aigh, think tha, the size of 

las, element in the an-ay , the null potamr ^ *T S ‘ ze argc * '■ and 
command: pomtei ' Here 18 another version of the echo 
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#inc1ude <stdio.h> 

int main (int argc, char **argv) 

{ 

while (*argv != NULL) 

printf("%s ", *argv++); 

putchar('\n'); 
return 0; 

} 


■ ■ ■ ■■ ■ 
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Run the program so that you understand its effects. Give a detailed explanation of 
how it works. Hint: If a pointer to pointer to char is dereferenced, then the result is 
a pointer to char. Also, reread the dissection of the strcatO function in Section 
6.11, “String-Handling Functions in the Standard Library,” on page 276. 

40 In C, a function prototype can occur more than once. Moreover, equivalent function 
prototypes do not conflict. Modify the fimLroots program that we wrote in Section 
6.18, “Arrays of Pointers to Function,” on page 302, by replacing the function proto¬ 
type 

dbl bisection(dbl f(dbl x), dbl a, dbl b); 
with the following list of function prototypes: 


_1 




a; 


1 




(St ■ 








dbl bisectionCpfdd, dbl, dbl); 

dbl bisectionCpfdd f, dbl a, dbl b); 

dbl bisection(dbl (*)(dbl), dbl a, dbl b); 

dbl bisection(dbl (*f)(dbl), dbl a, dbl b); 

dbl bisection(dbl f(dbl), dbl a, dbl b); 

dbl bisection(dbl f(dbl x), dbl a, dbl b); 

Does your compiler complain? (It shouldn't.) 

41 In Section 6.18, “Arrays of Pointers to Function,” on page 302, we saw that the 
find^roots program made 49 calls to rootQ each time a root of a function was 
computed. Use a hand calculator to explain why precisely 49 calls were made. 
Hint: Most of the calls to rootQ cut the interval in half. 
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▼ Exercises 327 


42 Compile and execute the following program so that you understand its effects: 

#include <stdio.h> 

#include <string.h> 

void tell_ine(int f(const char *, const char *)); 

int main(void) 

{ 

tell_me(strcmp) ; 
tell_me(main); 
return 0; 

} 

void tell_me(int f(const char *, const char *)) 

{ 

if (f == strcmp) 

pri ntf ("Address of strcmpO : %p\n", f) ; 

ol CD 

C I 

printf("Function address: %p\n", f); 

} 

Because the pointer being passed in the second call to tell_me() has the wrong 
type, your compiler should complain. Does it? Modify the program by changing the 
pointer type throughout to the generic void* type. Does this make your compiler 
happy? 

43 (Advanced) The following program has an error in it: 

#include <stdio.h> 

#include <string.h> 

int main(void) 

{ 

char *pl = "abc", *p2 = "pacific sea"; 

printf("%s %s %s\n", pi, p2, strcat(pl, p2)); 
return 0; 

} 

\ 

On our system, what happens is compiler-dependent. With one of our compilers, 
the program exhibits a run-time error. With another compiler, we get the following 
written to the screen: 

abcpacific sea acific sea abcpacific sea 
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This output makes sense and tells us something about the compiler. What program¬ 
ming error did we make? What does the output tell us about our compiler? Which is 
the preferred behavior: a compiler that produces executable code that exhibits a 
run-time error, or a compiler that produces (sometimes) logically incorrect output? 


44 A function name by itself is treated by the compiler as a pointer. This is a general 
rule in C. Is the following code legal? 

#include <stdio.h> 

void f(void); 
void g(void); 
void h(void); 

int main(void) 

{ 

c*f)C); 

return 0; 

} 

void f(void) 

{ 

printf("Hello from f().\n"); 

(C(*g)))0; 

} 

void g(void) 

{ 

printf("Hello from g().\n"); 

(*(*(*h)))(); 

} 

void h(void) 

{ 

printf("Hello from h().\n"); 

} 


Write your answer before you try to compile the program. Your answer should be 
based on your general knowledge about how well a C compiler can be expected to 
follow a rule, even if the rule is applied in some strange way. 


45 A precondition for the root() solver to work is that f (a) and f (b) must have 
opposite signs. Use an assertion at the beginning of rootQ to check this condition. 


▼ Exercises 


329 


The ancient Egyptians wrote in hieroglyphics. In this system of writing, vowel 
sounds are not represented, only consonants. Is written English generally under¬ 
standable without vowels? To experiment, write a function i s_vowel () that 
returns 1 if a letter is a vowel and 0 otherwise. Use your function in a program that 
reads the standard input file and writes to the standard output file, deleting all 
vowels. Use redirection on a file containing some English text to test your program. 









Chapter 7 


There are two additional ways to represent discrete values: as bits and as elements in a 
finite set. In this chapter, we first discuss the bitwise operators. Even though expres¬ 
sions involving bitwise operators are explicitly system-dependent, they arc very useful. 
We illustrate their usefulness in packing and unpacking data. 

In the second half of the chapter, we discuss the enumeration types. Enumeration 
types are user-defined types that allow the programmer to name a finite set together 
with its elements, which are called enumerators. These types are defined and used by 
the programmer as the need arises. We illustrate much of this material by implement¬ 
ing a completely worked out interactive game program. 


7.1 Bitwise Operators and Expressions 


The bitwise operators act on integral expressions represented as strings of binary dig¬ 
its. These operators are explicitly system-dependent. We will restrict our discussion to 
machines having 8-bit bytes, 4-byte words, the two’s complement representation of 
integers, and ASCII character codes. 
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Bitwise operators 

Logical operators 

(unary) bitwise complement 



bitwise and 

& 


bitwise exclusive or 

A 


bitwise inclusive or 

1 

Shift operators 

left shift 

« 


right shift 

» 




- 




Like other operators, the bitwise operators have rules of precedence and associativity 
that determine how expressions involving them are evaluated. -ff 


Operators 

Associativity 

0 □ ++ ( postfix ) — ( postfix .) 

left to right 

++ ( prefix ) — (prefix ) ! - sizeof (type) + 

( unary) - (unary) & (address) * (dereference) 

right to left 

* / % 

. ••• .■ i 

left to right 

+ 

left to right 

« » 

left to right 

< <~ > >= 

left to right 


left to right 

& 

left to right 

A 

left to right 

i 

left to right 


left to right 

11 

left to right j 

? . 

« « 

right to left 

=: +— j 

= %= »= «= &= A- | =s 


, (comma operator) 

left to right 


f i'he operator ~ is unary; all the other bitwise operators are binary. They operate on inte- 
^- gral expressions. We will discuss each of the bitwise operators in detail. 




Bitwise Complement 


if: The operator ~ is called the one’s complement operator, or the bitwise complement oper- 
JHator. It inverts the bit string representation of its argument; the Os become Is, and the 
K Is become Os. Consider, for example, the declaration 


i nt 


70707 




The binary representation of a is 

Sir ? 

00000000 00000001 00010100 00110011 


mm 


The expression ~a is the bitwise complement of a, and this expression has the binary 
|§j? representation 

| 11111111 11111110 11101011 11001100 
f The i nt value of the expression ~a is -70708. 
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Two’s Complement 

The two’s complement representation of a nonnegative integer n is the bit string 
obtained by writing n in base 2. If we take the bitwise complement of the bit string and 
add 1 to it, we obtain the two’s complement representation of -n. The next table gives 
some examples. To save space, we show only the two low-order bytes. 
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Value 
of n 

Binary 

representation 

Bitwise complement 

Two’s complement 
representation of -n 

Value 
of -n 

7 

00000000 00000111 

11111111 11111000 

11111111 11111001 

-7 

8 

00000000 00001000 

11111111 11110111 

11111111 11111000 

-8 

9 

00000000 00001001 

11111111 11110110 

11111111 11110111 

-9 

-7 

liiiim nine©! 

00000000 00000110 

00000000 00000111 

t 






The preceding table is read from left to right. If we start with a positive integer n, 
consider its binary representation, and take its bitwise complement and add 1, then we 
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obtain the two’s complement representation of -n. A machine that uses the two’s com¬ 
plement representation as its binary representation in memory for integral values is 
called a two’s complement machine. 

On a two’s complement machine, if we start with the binary representation of a nega¬ 
tive number -n and take its bitwise complement and add 1, we obtain the two’s comple¬ 
ment representation, or binary representation, of n. This is illustrated in the last line in 
the preceding table. 

The two’s complement representations of both 0 and -1 are special. The value 0 has 
all bits off; the value -1 has all bits on. Note that if a binary string is added to its bitwise 
complement, then the result has all bits on, which is the two’s complement representa¬ 
tion of -1. Negative numbers are characterized by having the high bit on. 

On a two’s complement machine, the hardware that does addition and bitwise com¬ 
plementation can be used to implement subtraction. The operation a - b is the same 
as a + (-b), and -b is obtained by taking the bitwise complement of b and adding L. 
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Bitwise Binary Logical Operators 
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The three operators & (and), a (exclusive or), and 1 (inclusive or) are binary operators, af 
They take integral expressions as operands. The two operands, properly widened, are |J 
operated on bit position by bit position. The following table shows the bitwise opera- 
tors acting on 1-bit fields. The table defines the semantics of the operators. m. 




The next table contains examples of the bitwise operators acting on i nt variables, 









itwise Operators and Expressions 


Dec aration and initializations 


int a = 33333, b = -77777; 


Expression Representation 




a & b 


a a b 



~(a i b) 


(~a & ~b) 


00000000 00000000 10000010 00110101 


11111111 11111110 11010000 00101111 


00000000 00000000 10000000 00100101 


11111111 11111110 01010010 0001101 


11111111 11111110 11010010 00111111 


00000000 00000001 00101101 11000000 


00000000 00000001 00101101 11000000 


33333 



110054 


77248 


j Left and Right Shift Operators 

'; The two operands of a shift operator must be integral expressions. The integral prorno- 
§ tions are performed on each of the operands. The type of the expression as a whole is 
that of its promoted left operand. An expression of the form 

v i -A* 

« exprl « expr2 

p| causes the bit representation of exprl to be shifted to the left by the number of places 
I specified by expr2. On the low-order end, Os are shifted in. 


Declaration and initialization 


char c = ’Z 


* 7 I , 


Representation 



c « 4 


c « 31 


00000000 00000000 00000000 01011010 unshifted 

00000000 00000000 00000000 10110100 


00000000 00000000 00000101 10100000 left-shifted 4 




I; Even though c is stored in 1 byte, in an expression it gets promoted to an i nt. When 
|f shift expressions are evaluated, integral promotions are performed on the two oper- 
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ands separately, and the type of the result is that of the promoted left operand. Thus, 
the value of an expression such as c « 1 gets stored in 4 bytes. 

The right shift operator » is not symmetric to the left shift operator. For unsigned 
integral expressions, Os are shifted in at the high end. For the signed types, some 
machines shift in Os, while others shift in sign bits. (See exercise 4, on page 357.) The 
sign bit is the high-order bit; it is 0 for nonnegative integers and 1 for negative integers. 


Declarations and initializations 

int a = 1 « 31; /* shift 1 to the high bit V 

unsigned 1; 

) - 1 « 31; 


Expression 

Representation 

Action 

a 

10000000 00000000 00000000 00000000 

unshifted 

a >> 3 

11110000 00000000 00000000 00000000 


b 

10000000 00000000 00000000 00000000 

unshifted 

b » 3 


right-shifted 3 


Note that on our machine, sign bits are shifted in with an i nt. On another machine, Os 
might be shifted in. To avoid this difficulty, programmers often use unsigned types 
when using bitwise operators. ' 

If the right operand of a shift operator is negative or has a value that equals or 
exceeds the number of bits used to represent the left operand, then the behavior is 
undefined. It is the programmer’s responsibility to keep the value of the right operand 
within proper bounds. 

Our next table illustrates the rules of precedence and associativity with respect to the 
shift operators. To save space, we show only the two low-order bytes. 


Declaration and assignments 

unsigned a = 1, b = 2; 

Expression 

Equivalent expression 

Representation 

Value 

a « b >> 1 

(a « b) » 1 

00000000 00000010 

2 

a « 1 + 2 « 3 

(a « (1 + 2)) « 3 

00000000 01000000 

64 

a + b « 12 v a >> b 

((a + b) « (12 * a)) » b 

00001100 00000000 

. 

3072 


In C++, the two shift operators are overloaded and used for input/output. Overload¬ 
ing in C++ is a method of giving existing operators and functions additional meanings. 
(See Section 13.5, “Overloading,” on page 603, for examples and explanation.) 


7 -2 v Masks 


' " • - * 


Vi yy ask “ * constant or variable that is used to extract desired bits from another vari 
i able « “Piston. Because the i nt constant 1 has the bit representation 

iii$«S 

bI'KL r K.*2f>::' • 


gg 00000000 00000000 00000000 0000000! 

|t it can be used to determine the low-nrrW hit of „ 

t „ 0 „ c ... m , , , low orciei 1111 ot dn i nt expression. The following code 

|c hses ™- s mas ^ and prints an alternating sequence of Os and 1 s* 


'SfiSr' 

'• 


int i, mask = 1; 
for (i = 0; i < 10 ; ++-,*) 
printf("%d”, i & mask); 

If we wash to find the value of it nspHniinn u,, 

ic t th ., t _ ... ,' 01 a , partlcular bit m an expression, we can use a mask that 

is m that position and 0 elsewhere. For example, we can use the exnressinn i ^ i 

a mask for the third bit, counting from the right. The expression ^ 2 ’ dS 


(v & (1 « 2)) ? 1 : 0 

has the value 1 or 0 depending on the third bit in v. 

HP) Another example of a mask is the constant value 255, which is 98 
p| lowing bit representation: 1 




1. It has the fol- 


00000000 00000000 00000000 11111111 


• j; 

: . : ■ 




| Because only the low-order byte is turned on, the expression 


v & 255 


- ? - , 


Will yield a value having a bit representation with all its high-order bytes zero and it, 

low-order byte the same as the low-nrrlpr in!/, in u * ‘ e ^ 

... , , cth UIL IQ w oiaer byte m v. We express this bv saving “ocr ic .. 

mask for the low-order byte.” } y ng ’ 18 a 


Wm 


SK 


flglttS 

:S~: &.■ 


■-« -X, . . 
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73 Software Tools: Printing an int Bitwise 

Software tools are utilities that the programmer can use to write software. Most sys¬ 
tems provide a variety of software tools. Examples are compilers, debuggers, and the 
make utility. We will discuss these in Chapter 11, “Input/Output and the Operating Sys¬ 
tem.” Programmers often write other software tools for their own use as the need 
arises. The bit_print() function that we discuss in this section is a typical example. 
For anyone writing software that deals with the machine at the bit level, the 
bit_print() utility is essential; it allows the programmer to see what is happening. 
For the beginning programmer, exploration with bi t_pri nt() helps to provide a con¬ 
ceptual framework that is very useful. 

Our bi t_pri nt() function uses a mask to print out the bit representation of an i nt. 
The function can be used to explore how values of expressions are represented in mem¬ 
ory. We used it, in fact, to help create the tables in this chapter. 

In file bit_print,c 

/* Bit print an int expression. */ 

#inc1ude <limits.h> 

void bit_print(int a) 

{ 

i nt i; 

int n = sizeof(int) * CHAR_BIT; 
int mask = 1 « (n - 1); 

for (i =1; i <= n; ++i) { 

putchar(((a & mask) == 0) ? '0' '■ '1'); 
a «— 1; 

if (i % CHAR_BIT == 0 && i < n) 
putchar(' '); 

} 

} 


/* in limits.h */ 

/* mask = 100...0 */ 




m 

1 |gj 


RM 

' k 

| 1 
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Dissection of the bit_print() Function 


#include <limits.h> 


In ANSI C, the symbolic constant CHAR_BIT is defined in limits.h. In traditional C, this 
header file is not usually available. The value of CHARJ3IT on most systems is 8. It rep- 
resents the number of bits in a char, or equivalently, the number of bits in a byte. ANSI 
C requires at least 8 bits in a byte. 


i nt 


sizeof(int) * CHAR_BIT; 


/* in limits.h */ 


Because we want this function to work on machines having either 2- or 4-byte words, 
we use the variable n to represent the number of bits in a machine word. We expect the 
value of the expression si zeof (i nt) to be either 2 or 4, and we expect that the sym¬ 
bolic constant CHAR_BIT, which is defined in the standard header file limits.h, will be 8. 
Thus, we expect n to be initialized to either 16 or 32, depending on the machine. 


int mask = 1 « (n 


/* mask = 100...0 */ 


Because of operator precedence, the parentheses are not needed in the initialization. 
We put them there to make the code more readable. Because << has higher precedence 
than = , the expression 1 « (n - 1)) gets evaluated first. Suppose that n has value 
32. The constant 1 has only its low-order bit turned on. The expression 1 « 31 shifts 
that bit to the high-order end. Thus, mask has all its bits off except for its high-order bit 
(sign bit), which is on. 


for (i =1; i <= n; ++i) { 

putchar(((a & mask) == 0) ? ’0 
a «= 1; 


First consider the expression 


’I'); 


(a & mask) == 0 
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If,the high-order bit in a is off, then the expression a & mask has all its bits off, and the 
expression (a & mask) == 0 is true. Conversely, if the high-order bit in a is on, then 
the expression a & mask has its high-order bit on, and the expression (a & mask) == 0 
is false. Now consider the expression 


((a & mask) 


0 ) ? ' 0 ' : ' 1 ' 


If the high-order bit in a is off, then the conditional expression has the value ' 0'; other¬ 
wise, it has the value ' 1'. Thus, putcharf) prints a 0 if the high-order bit is off and 
prints a 1 if it is on. 


putchar(C(a & mask) == 0) ? '0 
a «= 1: 




Alter the high-order bit in a has been printed, we left-shift the bits in a by 1 and place 
the result back in a. Recall that 

a «= 1; is equivalent to a = a « 1; 

fhe value of the expression a « 1 has the same bit pattern as a, except that it has 
been left-shifted by 1. The expression by itself does not change the value of a in mem¬ 
ory. In contrast to this, the expression a «= 1 does change the value of a in memory. 

Its effect is to bring the next bit into the high-order position, ready to be printed the 
next time through the loop. 

a if (i % CHAR_BIT == 0 && i < n) 
putcharf' '); 

If we assume that the value of the symbolic constant CHAR_.BIT is 8, then this code 
causes a blank to be printed after each group of 8 bits has been printed. It is not neces¬ 
sary to do this, but it certainly makes the output easier to read. 

: Pi! 




7.4 Packing and Unpacking 

The use of bitwise expressions allows for data compression across byte boundaries. 
This is useful in saving space, but it can be even more useful in saving time. On a 
machine with 4-byte words, each instruction cycle processes 32 bits in parallel. The fol¬ 
lowing function can be used to pack four characters into an i nt. It uses shift opera¬ 
tions to do the packing byte by byte. 

In file pack_bits.c 

/* Pack 4 characters into an int. */ 

#include <limits.h> 


int pack(char a, char b, char c, char d) 

{ 

int p = a; /* p will be packed with a, b, c, d */ 

p = (p « CHAR_BIT) | b; 

p = (p « CHAR_BIT) | c; 

p = (p « CHAR_BIT) | d; 

return p; 


To test our function, we write a program with the lines 


printf("abed = "); 
bit_print(pack('a', 
putcharC'Xn'); 


'b\ 'c‘, 'd')) ; 


in main(). Here is the output of our test program: 


abed = 01100001 01100010 01100011 01100100 


Observe that the high-order byte has value 97, or 'a', and that the values of the 
remaining bytes are 98, 99, and 100. Thus, pack() did its work properly. 

Having written pack(), we now want to be able to retrieve the characters from within 
the 32-bit i nt. Again, we can use a mask to do this. 
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/* Unpack a byte from an int. */ 

#include <limits.h> 

char unpack(int p, int k) 

{ 

int n = k * CHARJ3IT; 

unsigned mask = 255; 

mask «= n; 

return ((p & mask) » n); 


/* k = 0, 1, 2, or 3 */ 

/* n = 0, 8, 16, or 24 */ 
/* low-order byte */ 



Dissection of the unpackC) Function 

m #include climits.h> 

We have included this header file because it contains the definition of the symbolic con¬ 
stant CHAR_BIT. It represents the number of bits in a byte. On most machines its value 
is 8. 


char unpack(int p, int k) 

{ 


/* k = 0, 1, 2, or 3 */ 


# « * * * 


We think of the parameter p as a packed i nt with its bytes numbered 0 through 3. The 
parameter k will indicate which byte we want: If k has value 0, then we want the low- 
order byte; if k has value 1, then we want the next byte; and so forth. 


i nt 


n = k * CHAR BIT 


/* n = 0, 8, 16, or 24 */ 


If we assume that CHAR_BIT is 8 and that k has value 0,1, 2, or 3, then n will be initial¬ 
ized with the value 0, 8, 16, or 24. 


unsigned mask = 255 


/* low-order byte */ 


The constant 255 is special; to understand it, first consider 256. Because 256 = 2 8 , the 
bit representation of 256 has all bits 0 except for a 1 in the 9th bit, counting from the 
low-order bit. Because 255 is one less than 256, the bit representation of 255 has all 
bits 0, except for the first 8 bits, which are all 1. (See exercise 8, on page 358.) Thus, the 
binary 7 representation of mask is 
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00000000 00000000 00000000 11111111 


mask «= n 


Let us assume that CHAR_BIT is 8. If n has value 0, then the bits in mask are not 
changed. If n has value 8, then the bits in mask are left-shifted by 8. In this case we 
think of mask stored in memory as 

00000000 00000000 11111111 00000000 

If n has value 16, then the bits in mask are left-shifted by 16. In this case we think of 
mask stored in memory as 

00000000 11111111 00000000 00000000 

In a similar fashion, if n has value 24, then mask will have only the bits in its high-order 
byte turned on. 

■ (p & mask) » n 

Parentheses are needed because & has lower precedence than ». Suppose that p has 
value -3579753 (which we chose because it has a suitable bit pattern), and suppose that 
n has value 16. The following table illustrates what happens: 


Expression 

Binary representation 

Value 

P 

11111111 11001001 01100000 10010111 

-3579753 

mask 

00000000 11111111 00000000 00000000 

16711680 

p & mask 

00000000 11001001 00000000 00000000 

13172736 

(p & mask) » n 

00000000 00000000 00000000 11001001 

201 


■ return ((p & mask) » n); 

ft Because the function type for unpackC) is char, the int expression (p & mask) » n 
gets converted to a char before it gets passed back to the calling environment. When an 
i nt is converted to a char, only the low-order byte is kept; other bytes are discarded. 




Imagine wanting to keep an abbreviated employee record in one integer. We will sup¬ 
pose that an “employee identification number” can be stored in 9 bits and that a “job 
type” can be stored in 6 bits, which provides for a total of up to 64 different job types. 


■ vv: :ryJI 
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The employee’s “gender” can be stored in 1 bit. These three fields will require i s, 
which, on a machine with 4-byte words, is a short integer. We can think of the t ree it 

fields as follows: 



■US 
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Identification 

bbbbbbbbb 


Job type 
bbbbbb 


Gender 
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The following function can be used in a program designed to enter employee data into 
a short. The inverse problem of reading data out of the short would be accomplishedg 

with the use of masks. 

/* Create employee data in a short int. */ 

short create_employee_data(int icLno, int job__type, char gender) 
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short employee 


/* start with all bits off ’V 


V 


employee |= (gender == ’m 1 I I gender == 'M ) .0:1; 
employee |= job_type « 1; 
employee |= id_.no « 7; 
return employee; 




■: . 


p? 



Multibyte Character Constants j 

Multibyte characters are allowed in ANSI C. An example is ' abc'. On a machine with 4- g| 
byte words, this causes the characters 'a', ' b', and ' c' to be packed into a sing 
word. However, the order in which they are packed is machine-dependent. Someg 
machines put 1 a' in the low-order byte; others put it in the high-order byte. ( ee exer,^. 

cise 12, on page 359.) ms 




lie®: 

m 



7.5 ▼ Enumeration Types 345 


7.5 Enumeration Types 

The keyword enum is used to declare enumeration types. It provides a means of naming 
a finite set, and of declaring identifiers as elements of the set. Consider, for example, 
the declaration 

enum day {sun, mon, tue, wed, thu, fri, sat}; 

This creates the user-defined type enum day. The keyword enum is followed by the tag 
name day. The enumerators are the identifiers sun, mon, ..., sat. They are constants of 
type i nt. By default, the first one is 0, and each succeeding one has the next integer 
value. This declaration is an example of a type specifier, which we also think of as a 
template. No variables of type enum day have been declared yet. To do so, we can now 
write 

enum day dl, d2; 

Tills declares dl and d2 to be of type enum day. They can take on as values only the ele¬ 
ments (enumerators) in the set. Thus, 

dl = f ri; 

assigns the value f ri to dl, and 
if (dl == d2) 

. /* do something */ 

tests whether dl is equal to d2. Note carefully that the type is enum day. The keyword 
enum by itself is not a type. 

The enumerators can be initialized. Also, we can declare variables along with the 
template, if we wish to do so. The following is an example: 

enum suit {clubs = 1, diamonds, hearts, spades} a, b, c; 

Because cl ubs has been initialized to 1, di amonds, hearts, and spades have the values 
2, 3, and 4, respectively. In this example 

enum suit {clubs = 1, diamonds, hearts, spades} 
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is the type specifier, and a, b, and c are variables of this type. Here is another example 
of initialization: 

-enum fruit {apple = 7, pear, orange = 3, lemon} frt; 

Because the enumerator apple has been initialized to 7, pear has value 8. Similarly, 
because orange has value 3, lemon has value 4. Multiple values are allowed, but the 
identifiers themselves must be unique. 

enum veg {beet = 17, carrot = 17, corn = 17} vegel, vege2; 

The tag name need not be present Consider, for example, 

enum {fir, pine} tree; 

Because there is no tag name, no other variables of type enum {fi r, pi ne} can be 
declared. 

The following is the syntax for the enumeration declaration: 

enum_declaration ::= enum_type_specifier identifier { , identifier }$+ ; 
enum_type_specifier :: = enum ejag { ejist } 

I enum ejag 
I enum { ejist } 
ejag :: = identifier 

ejist :: = enumerator { , enumerator } 0+ 

enumerator :: = identifier { = constantJntegraLexpression } opt 

In general, one should treat enumerators as programmer-specified constants and use 
them to aid program clarity. If necessary, the underlying value of an enumerator can be 
obtained by using a cast. The variables and enumerators in a function must all have dis¬ 
tinct identifiers. The tag names, however, have their own name space. This means that 
we can reuse a tag name as a variable or as an enumerator. The following is an example: 

enum veg {beet, carrot, corn} veg; 

Although this is legal, it is not considered good programming practice. 

We illustrate the use of the enumeration type by writing a function that computes the 
next day which uses typedef to replace the enum keyword in the type declaration. 
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In file next_day.c 

/* Compute the next day. */ 

enum day {sun, mon, tue, wed, thu, fri, sat}; 

typedef enum day day; /*the usual typedef trick */ 

day find_next_day(day d) 

{ 

day next__day; 

switch (d) { 
case sun: 

next_day = mon; 
break; 
case mon: 

next_day = tue; 
break; 
case tue: 

next_day = wed; 
break; 
case wed: 

next_day = thu; 
break; 
case thu: 

next„day = fri; 
break; 
case fri: 

next_day = sat; 
break; 
case sat: 

next_day = sun; 
break; 

} 

return next_day; 

} 

Recall that only a constant integral expression can be used in a case label. Because enu¬ 
merators are constants, they can be used in this context. The following is another ver¬ 
sion of this function; this version uses a cast to accomplish the same ends: 





348 Chapter 7 T Bitwise Operators and Enumeration Types 


/* Compute the next day with a cast. */ 

enum day {sun, mon, tue, wed, thu, fri, sat}; 

typedef enum day day; 

day find_next_day(day d) 

{ 

assert((int) d >= 0 && (int) d < 7) 
return ((day)(C(int) d +1) % 7)); 

} 

Enumeration types can be used in ordinary expressions provided type compatibility 
is maintained. However, if one uses them as a form of integer type and constantly 
accesses their implicit representation, it is better just to use integer variables instead. 
The importance of enumeration types is their self-documenting character, where the 
enumerators are themselves mnemonic. Furthermore, enumerators force the compiler 
to provide programmer-defined type checking so that one does not inadvertently mix 
apples and diamonds. 


7.6 An Example: The Game of Paper, Rock, Scissors 

We will illustrate some of the concepts introduced in this chapter by writing a program 
to play the traditional children’s game called “paper, rock, scissors.” In this game each 
child uses her or his hand to represent one of the three objects. A flat hand held in a 
horizontal position represents “paper,” a fist represents “rock,” and two extended fin¬ 
gers represent “scissors.” The children face each other and at the count of three display 
their choices. If the choices are the same, then the game is a tie. Otherwise, a win is 
determined by the rules: 


Paper covers the rock. 
Rock breaks the scissors. 
Scissors cut the paper. 


We will write this program in its own directory. The program will consist of a .h file 
and a number of x files. Each of the x files will include the header file at the top of the 
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file. In the header file we put #include directives, templates for our enumeration 
types, type definitions, and function prototypes: 

In file p_r_s.h 

/* The game of paper, rock, scissors. */ 

#include <ctype.h> /* for isspaceO */ 

#include <stdio.h> /* for printfQ, etc */ 

#include <std1ib.h> /* for randQ and srand() */ 

finclude <time.h> /* for timeQ */ 

enum p_r_s {paper, rock, scissors, 

game, help, instructions, quit}; 
enum outcome {win, lose, tie, error}; 

typedef enum p_r_s p_r_s; 

typedef enum outcome outcome; 

outcome compare(p_r_s player_choice, 
p_r_s machine_choice); 

void prn_final_status(int win_cnt, int lose_cnt); 

void prn_game_status(int win_cnt, 

int lose_cnt, int tie_cnt); 
void prn_help(void); 

void prn_instructions(void); 

void report_and_tabulate(outcome result, 

int *win_cnt_ptr, 
int *lose_cnt_ptr, 
int *tie_cnt_ptr); 

p_r_s selection_by_machine(void); 

p_r„s selection_by_player(void); 

We do not normally comment our #i ncl ude lines, but here we are trying to make the 
code more readable for the novice programmer. Here is our mai n () function: 
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In file main.c 

#include "p_r_s.h" 
int main(void) 

int win_cnt = 0, lose_cnt = 0, tie_cnt = 0; 

outcome result; 

p_r_s player_choice, machine_choice; 

srand(time(NULL)); /* seed the random number generator */ 

prn_instructions(); 

while_C(p1ayer_choice = selection_by_player()) != quit) 
switch (player_choice) { 
case paper: 
case rock: 
case scissors: 

machine_choice = selection_by„machine(); 
result = compare(piayer_choice, machine_choice); 
report_and_tabulate(result, &win_cnt, &lose_cnt, 

&tie_cnt); 

break; 
case game: 

prn_game_status(win_cnt, lose_cnt, tie_cnt); 
break; 

case instructions: 
prn_instructions(); 
break; 
case help: 
prn_help(); 
break; 
default: 

pri'ntf("\nPROCRAMMER ERROR: Cannot get to here!\n\n"); 
exit(I); 

} 

prn_game_status(win_cnt, lose_cnt, tie_cnt); 
prn_final_status(win_cnt, lose_cnt); 
return 0; 

} 

The first executable statement in mai n Q is 
srand(time(NULL)); 

This seeds the random-number generator rand(), causing it to produce a different 
sequence of integers each time the program is executed. More explicitly, passing 
srand() an integer value determines where rand() will start. The function call 
time (NULL) returns a count of the number of seconds that have elapsed since 1 Janu¬ 
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ary 1970 (the approximate birthday of UNIX). Both s rand () and ti me () are provided in 
the standard library. The function prototype for srandO is in stdlib.h, and the function 
prototype for ti me 0 is in time.h. Both of these header files are provided by the system. 
Note that we included them in p_r_s.h. 

The next executable statement in main() calls prn_instructions(). This provides 
instructions to the user. Embedded in the instructions are some of the design consider¬ 
ations for programming this game. We wrote this function, along with other printing 
functions, in prn.c. 

In file prn.c 

#include "p_r_s.h" 

void prn_fi'nal_ 5 tatus(int win_cnt, int lose_cnt) 

{ 

if (win_cnt > lose_cnt) 

printf("CONGRATULATIONS - You won!\n\n"); 
else if (win_cnt == lose_cnt) 

printf("A DRAW - You tied!\n\n"); 
el se 

printf("SORRY - You lost!\n\n"); 

} 

void prn_game_status(int win_cnt, int lose_cnt, int tie_cnt) 

{ 

printf("\n%s\n%s%4d\n%s%4d\n%s%4d\n%s%4d\n\n", 

"GAME STATUS:", 

" Win: ", win_cnt, 

" Lose: ”, lose_cnt, 

" Tie: ", tie_cnt, 

" Total: ", win_cnt + lose_cnt + tie_cnt); 


void prn_help(void) 

{ 

printf("\n%s\n", 

"The following characters can be used for input:\n" 
" p for paper\n" 

" r for rock\n" 

" s for scissors\n" 

" g print the game status\n" 

" h help, print this list\n" 

" i reprint the instructions\n" 

" q quit this game\n"); 
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void prn_instructions(void) 

printf("\n%s\n", 

"PAPER, ROCK, SCISSORS:\n" 

" c ?! i^ S \H ame p is f° r V'paper,\" r is for \"rock,\" and" 
s is for \ scissors,\"\n" 

„ tbe P^yer and the machine\n" 

„ choose one of p, r, or s," 

„ I> the two choices are the same,\n” 

„ tben the game is a tie. Otherwise:\n" 

n Y'paper covers the rock\" (a win for paper),\n" 

„ .\, ro( ?k breaks the scissors\" (a win for rock)An” 

„ \ scissors cut the paper\" (a win for scissors),\n' 

,, There are other allowable inputs:\n" 

„ 9 l or P 3 '" 6 status (the number of wins so far) An" 

h for help,\n" - ’ ' 

" i for instructions (reprint these instructions)An’ 

.i\ n „ 9 f° r c l u1t . (to quit the game).\n" 

"This game is played repeatedly until q is entered.\n" 

Good luck!\n"); 

r 


“irrnl? *l 8ame ' bot ^ ; hc machine and the player (user) need to make a selection from 
paper, rock, scissors.” We write these routines in selections 

In file selections 

#include "p_r_s.h" 


p_r_s selection_by_machine(void) 

{ 

return ((p_r_s) (randQ % 3)); 
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p_r_s selection_by_player(void) 

{ 

char c; 

p_r_s player_choice; 

printf("Input p, r, or s: "); 
while (isspace(c = getchar())) 


/* skip white space */ 


switch (c) { 
case 'p': 

player_choice = paper; 
break; 


^ c o * k 1 1 “ 

LCl J w I * 


pi ayer_choi ce = rock 
break; 


r a cn * c * 
Cl V* 


player_choice = scissors; 
break; 


case 1 g': 


player_choice = game 
break; 


case 'i' 


player_choice = instructions; 
break; 
case 1 q': 

player_choice = quit; 
break; 
default: 

piayer_choice = help; 
break; 

} 

return player_choice; 


The machine’s selection is computed by the uses of the expression rand () % 3 to pro¬ 
duce a randomly distributed integer between 0 and 2. Because the type of the function 
is p_s_r, the value returned will be convened to this type, if necessary. We provided an 
explicit cast to make the code more self-documenting. 

Note that in sel ecti on_by_pl aye r() we use the macro isspaceO from ctype.h to 
skip white space. (See Section 8.7, “The Macros in stdio.h and ctype.h,” on page 382.) 
After white space is skipped, all other characters input at the terminal are processed, 
most of them through the default case of the swi tch statement. 

The value returned by sel ecti on_by_pl ayer() determines which case gets exe¬ 
cuted in the switch statement in main(). The value returned depends on what the 
player types. If the character g is input, then prn_game_status() is invoked; if any 
character other than white space or p, r, s, g, i, or q is input, then prn_help() is 
invoked. 
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Once the player and the machine have made a selection, we need to compare the two 
selections in order to determine the outcome of the game. The following function does 
this: 

In file compare.c 

#include "p_r_s.h" 

outcome compare(p_r_s player_choice, p_r_s machine_choice) 

{ 

outcome result; 

if (player_choice == machine_choice) 
return tie; 

switch (player_choice) { 
case paper; 

result = (machine_choice == rock) ? win : lose; 
break; 
case rock: 

result = (machine_choice == scissors) ? win : lose; 
break; 

case scissors: 

result = (machine_choice == paper) ? win : lose; 
break; 
default: 

printf("\nPROGRAMMER ERROR: Unexpected choice!\n\n"); 
exit(l); 

} 

return result; 

} 

The value returned by the call to compare() in mai n() gets passed to the function 
report_and_tabulate(). This function reports to the user the result of a round of 
play and increments as appropriate the number of wins, losses, and ties. 



s', s ■ s 
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In file report.c 

#include "p_r_s.h" 

void report_and_tabulate(outcome result, 

int *win_cnt_ptr, int *lose_cnt_ptr, int *tie_cnt_ptr) 

{ 

switch (result) { 
case win: 

++*win_cnt_ptr; 
printf("%27sYou win.\n", ""); 
break; 
case lose: 

++*1ose_cnt_ptr; 

printf("%27sYou lose.\n", ""); 

break; 

ra <;p fi p ■ 

++*tie_cnt_ptr; 
printf("%27sA tie.\n", ""); 
break; 
default: 

printf("\nPROGRAMMER ERROR: Unexpected result!\n\n"); 
exit(l); 

} 


We are now ; ready to compile our function. We can do this with the command 

cc -o p_r_s ma'm.c compare.c prnx reportx selectionx 

Later, after we have learned about the make utility, we can facilitate program develop¬ 
ment by using an appropriate makefile. (See Section 11.17, “The Use of make,” on page 
532.) 
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Summary 

1 The bitwise operators provide the programmer with a means of accessing the bits 
in an integral expression. Typically, we think of the operands of these operators as 
bit strings. 

2 The use of bitwise expressions allows for data compression across byte boundaries. 
This capability is useful in saving space, but is more useful in saving time. On a 
machine with 4-byte words, each instruction cycle processes 32 bits in parallel. 

3 Most machines use the two's complement representation for integers. In tins repre¬ 
sentation, the high-order bit is the sign bit. It is 1 for negative integers and 0 for 
nonnegative integers. 

4 Bitwise operations are explicitly machine-dependent. A left shift causes Os to be 
shifted in. The situation for a right shift is more complicated. If the integral expres¬ 
sion is unsi gned, then Os are shifted in. If the expression is one of the signed types, 
then what gets shifted in is machine-dependent. Some machines shift in sign bits. 
This means that if the sign bit is 0, then Os are shifted in, and if the sign bit is 1, 
then Is are shifted in. Some machines shift in Os in all cases. 

5 Masks are particular values used typically with the & operator to extract a given 
series of bits. Packing is the act of placing a number of distinct values into various 
subfields of a given variable. Unpacking extracts these values. 

6 The keyword enum allows the programmer to define enumeration types. A variable 
of such a type can take values from the set of enumerators associated with the type. 

7 Enumerators are distinct identifiers chosen for their mnemonic significance. Their 
use provides a type-checking constraint for the programmer, as well as self-docu¬ 
mentation for the program. 

8 Enumerators are constants of type i nt. Thus, they can be used in case labels in a 
swi tch. A cast can be used to resolve type conflicts. 
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1 Suppose that integers have a 16-bit two’s complement representation. Write the 
binary representation for -1, -5, -101, -1023. Recall that the two’s complement rep¬ 
resentation of negative integers is obtained by taking the bit representation of the 
corresponding positive integer, complementing it, and adding 1. 

2 Alice, Betty, and Carole all vote on 16 separate referenda. Assume that each individ¬ 
ual’s vote is stored bitwise in a 16-bit integer. Write a function definition that 
begins 

short majority(short a, short b, short c) 

{ 

***** 

This function should take as input the votes of Alice, Betty, and Carole stored in a, 
b, and c, respectively. It should return the bitwise majority of a, b, and c. 

3 Write a function definition that begins 

int circular_shift(int a, int n) 

{ 

***** 

This function should left-shift a by n positions, where the high-order bits are rein¬ 
troduced as the low-order bits. Here are two examples of a circular shift operation 
defined for a char instead of an i nt: 

10000001 circular shift 1 yields 00000011 
01101011 circular shift 3 yields 01011011 

4 Does your machine shift in sign bits? Here is some code that will help you deter¬ 
mine this. Explain why this code works. 

int i = -1; /* turn all bits on */ 

unsigned u = -1; 

if (i » 1 == u » 1) 

printf("Zeros are shifted in.\n"); 
el se 

printf("Sign bits are shifted in.\n"); 
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5 Write a function that will reverse the bit representation of an i nt. Here are two 
examples of a reversing operation defined for a char instead of an i nt: 

01110101 reversed yields 10101110 

10101111 reversed yields 11110101 

6 Write a function that will extract every other bit position from a 32-bit expression. 
The result should be returned as a 16-bit expression. Your function should work on 
machines having either 2- or 4-byte words. 

7 Write a function that takes as its input a string of decimal integers. Each character 
in the string can be thought of as a decimal digit. The digits should be converted to 
4-bit binary strings and packed into an i nt. If an i nt has 32 bits, then eight digits 
can be packed into it. When you test your function, here is wliat you might see on 
the screen: 

Input a string of decimal digits: 12345678 
12345678 = 0001 0010 0011 0100 0101 0110 0111 1000 

Also, write an inverse function. It should unpack an i nt and return the original 
string. Hint: Here is one way to begin a conversion function: 

int convert(char *s) 

{ 

char *p; 

int a = 0; /* turn all bits off */ 

for (p = s; *p != '\0'; ++p) { 
a «= 4; 
switch (*p) { 
case '1': 
a |= 1; 
break; 

case '2': 

* « ■ * • 

8 Use the bit_pri nt() function to create a table containing n, the binary representa¬ 
tion for 2 n, and the binary representation for 2 n , for n = 0, 1, 2.32. If your 

machine has 2-byte words, then the output of your program should look like this: 

0: 00000000 00000001 00000000 00000000 

1 : 00000000 00000010 00000000 00000001 

2: 00000000 00000100 00000000 00000011 

■till 

15: 10000000 00000000 01111111 11111111 
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After you have done this, write down a similar table by hand that contains n, 10" 
and 10” - 1 for n = 0, 1, 2,..., 7. Write the numbers in base 10 in your table. Do you 
see the similarity between the two tables? Hint: Use the following code: 

int i, power = 1; 

for (i =0; i <32; ++i) { 

. printf("%2d: ", i); 
bit_print(power); 
printf(" "); 
bit_print (power - 1); 
putchar('\n'); 
power *= 2; 

} 

9 Some of the binary representations in the tables in this chapter are easy to check 
for correctness, and some are not. Use bi t_pri nt() to check some of the more dif¬ 
ficult representations. 

10 Write a version of the bi t_pri nt() function that will work on machines with either 
2- or 4-byte words. Hint: Use the si zeof operator to find the number of bytes in an 

i nt. 

11 If you are not familiar with the use of the constants 0xff, 0xff00, 0xff0000, and 
0xff000000 as masks, write a test program that uses bi t_pri nt() to print these 
values as bit strings. 

12 If your machine has 4-byte words, use the function bi t_pri nt() to find out how 
the multibyte character 'abc' is stored on your machine. If your machine has 2- 
byte words, then you can put only two characters into a multibyte character. In that 
case, try 'ab'. 

13 Write a roulette program. The roulette (machine) will select a number between 0 
and 35 at random. The player can place an odd/even bet, or can place a bet on a 
particular number. A winning odd/even bet is paid off at 2 to 1, except that all odd/ 
even bets lose if the roulette selects 0. If the player places a bet on a particular 
number and the roulette selects it, then the player is paid off at 35 to 1. If you play 
this game, how many one-dollar bets can you place before you lose ten dollars? 










360 Chapter 7 ▼ Bitwise Operators and Enumeration Types 


14 Write a function called previous_month() that returns the previous month. Start 
with the code 

enum month {jan, feb, ... , dec}; 

typedef enum month month; 

If jan is passed as an argument to the function, then dec should be returned. Write 
another function that prints the name of a month. More explicitly, if the enumera¬ 
tor jan is passed as an argument, then January should be printed. Write main() so 
that it calls your functions and produces a table of all twelve months, each one 
listed next to its predecessor month. Caution: When printf () is used, a variable 
of an enumeration type is printed as its implicit integer value. That is, 

printf("%d\n", jan); 
prints 0, not jan. 

15 Write a next-day program for a particular year. The program should take as input 
two integers, say 17 and 5, which represents 17 May, and it should print as output 
18 May, which is the next day. Use enumeration types in the program. Pay particu¬ 
lar attention to the problem of crossing from one month to the next. 

16 A twentieth-century date can be written with integers in the form day/month/year. 
An example is 1/7/33, which represents 1 July 1933. Write a function that stores 
the day, month, and year compactly. Because we need 31 different values for the 
day, 12 different values for the month, and 100 different values for the year, we can 
use 5 bits to represent the day, 4 bits to represent the month, and 7 bits to repre¬ 
sent the year. Your function should take as input the day, month, and year as inte¬ 
gers, and it should return the date packed into a 16-bit integer. Write another 
function that does the unpacking. Write a program to test your functions, 

17 Write a function that acts directly on a packed date, and produces the next calendar 
day in packed form. (See the previous exercise.) Contrast this to the program you 
wrote in the previous exercise. 

18 Rewrite the program given in Section 4.10, “An Example: Boolean Variables,” on 
page 170. Use the five low-order bits in the char variable b to represent the five 
boolean variables bl,..., b5. 
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19 Rewrite the program from the previous exercise to take advantage of machine arith¬ 
metic. Show by hand simulation that the effect of adding 1 to the bit representation 
for b is equivalent to the effect of the nested for statements. In this exercise, your 
program should generate the table using a single unnested for statement. 

20 (Balanced Meal Program) Use enumeration types to define five basic food groups: 
fish, fruits, grains, meats, and vegetables. Use a random number generator to select 
an item from each food group. Write a function meal () that picks an item from 
each of the five groups and prints out this menu. Print 20 menus. How many differ¬ 
ent menus are available? 

21 Write a function that picks out five cards at random from a deck of cards. Your 

function should check that all the cards in the hand are distinct. Recall that the 
spots on a playing card that represent its numeric value are called “pips.” A playing 
card such as the seven of hearts has a pip value 7 and a suit value hearts. The pip 
value for an ace is 1, a deuce is 2.and a king is 13. Use enumeration types to rep¬ 

resent the pips and suit values in your function. Write another function that prints 
out the hand in a visually pleasing way. 

22 Write a set of routines that test whether the hand generated by the function in the 
previous exercise is a straight, a flush, or a full house. A straight consists of five 
cards that can be placed in consecutive sequence by pip value. A flush consists of 
five cards of the same suit. A full house is three of a kind plus a pair. Run your ran¬ 
dom hand generat or and print out any hand that is one of these three kinds, along 
with the hand number. Continue to print out hands until one of each of the three 
kinds has been generated, or until you have generated 5,000 hands. If the latter 
happens, there is probably something wrong with your program. Do you know why? 

23 In the game “paper, rock, scissors,” an outcome that is not a tie is conveyed to the 
player by printing 

You win. or You lose. 

Rewrite the program so that messages like the following are printed: 

You chose paper and I chose rock. You win. 

24 Consider the function pack() given in Section 7.4, “Packing and Unpacking,” on 
page 341. The body of the function consists of four statements. Rewrite the func¬ 
tion so that these four statements are collapsed into a single return statement. 


25 Rewrite the function packQ so that only arithmetic operations are used. 
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26 On any machine, a mask of type 1 ong is acceptable. However, when we tried the fol¬ 
lowing initialization on a 2-byte machine, our code did not work as expected: 

long mask = 1 « 31; /* turn the high bit on: error! */ 

We made an egregious error. Can you explain what it is? 

27 How multicharacter character constants such as 'abc' get stored is system-denen- 
dent. Because programmers sometimes write 'abc' instead of "abc", some comb¬ 
ers provide a warning when multicharacter character constants get used, even if the 
use is proper. What happens on your system? Try the following code: 


i nt 


'abc'; 


printfC"'abc' = "); 
bit_print(c); 
printf("\n"); 

Here is the output on a Sun workstation: 

'abc' = 00000000 011000 H 01100010 01100001 

28 A useful implementation of the mathematical concept of set is an unsigned long 
treated as a set of up to 32 elements. 

typedef unsigned long set; 

const set empty = 0X0; /* use hexadecimal constants */ 

Write a routine that does set union using bit operators. Caution: Because union is a 
keyword, use another name. 

/" This function returns the union of a and b. */ 
set Union(set a, set b); 

By using masks you can examine whether a bit position is 1. Use this idea to write a 
function 


void display(set a); 
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that informatively prints out the members of a set. To test your functions, you 
could write 


set a = 0X7; 

set b = 0X55; 

display(Union(a, b)) 


/* a has elements 1, 2, 3 */ 

/* b has elements 1, 3, 5, 7 */ 

/* 1, 2, 3, 5, 7 is in the union */ 


(Project) Use the ideas presented in the previous exercise to develop a complete 
set manipulation package for sets whose size is 32 members or less. Thus, you need 
to write 

set Union(set a, set b); 

set intersection(set a, set b); 

set complement(set a); 

After you have written these functions and tested them, use arrays of i nts to repre¬ 
sent larger sets. The size of the arrays should allow for sets with 1,000 members or 
less. Modify your functions to work with these sets. 
















The C language uses the preprocessor to extend its power and notation. In this chapter, 
we present a detailed discussion of the preprocessor, including new features added by 
the ANSI C committee. We begin by explaining the use of #i nclude. Then we thor¬ 
oughly discuss the use of the #define macro facility. Macros can be used to generate 
inline code that takes the place of a function call. Their use can reduce program execu¬ 
tion time. 

Lines that begin with a # are called preprocessing directives. These lines communicate 
with the preprocessor. In ANSI C, the # can be preceded on the line by white space, 
whereas in traditional C, it must occur in column 1. The syntax for preprocessing direc¬ 
tives is independent of the rest of the C language. The effect of a preprocessing direc¬ 
tive starts at its place in a file and continues until the end of that file, or until its effect 
is negated by another directive. It is always helpful to keep in mind that the preproces¬ 
sor does not “know C.” 


8.1 The Use of #i ncl ude 

We have already used preprocessing directives such as 

#include <stdio.h> 

#include <stdlib.h> 

Another form of the #i ncl ude facility is given by 


#i ncl ude "filename" 
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This causes the preprocessor to replace the line with a copy of the contents of the 
named file. A search for the file is made first in the current directory and then in other 
system-dependent places. With a preprocessing directive of the form 

#i ncl ude <filename> 

the preprocessor looks lor the file only in the other places and not in the current direc¬ 
tory. In UNIX systems, the standard header files such as stdio.h and stdlib.h are typically 

found in /usr/include. In general, where the standard header files are stored is system- 
dependent. 

There is no restriction on what a #i ncl ude file can contain. In particular, it can con¬ 
tain other preprocessing directives that will be expanded by the preprocessor in turn. 


8.2 The Use of #define 

Preprocessing directives with #def i ne occur in two forms: 

#defi ne identifier token String 00t 

#define identifier ( identifier . identifier ) token.string opt 

The token_slring is optional. A long definition of either form can be continued to the 

next line by placing a backslash \ at the end of the current line. If a simple #defi ne of 

the first form occurs in a file, the preprocessor replaces every occurrence of identifier 

by token_string in the remainder of the file, except in quoted strings. Consider the 
example 

fdefine SECONDS_PER_DAY (60 * 60 * 24) 

In this example, the token string is (60 * 60 * 24), and the preprocessor will replace 

every occurrence of the symbolic constant SECONDS_PER_DAY by that string in the 
remainder of the file. 

The use of simple #defines can improve program clarity and portability. For exam¬ 
ple, if special constants such as n or the speed of light C cTTG used in a program, they 
should be defined. 


#define PI 3.14159 

#define C 299792.458 


/* speed of light in km/sec */ 
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Other special constants are also best coded as symbolic constants. 

#define EOF (-1) /* typical end-of-file value */ 

#define MAXINT 2147483647 /* largest 4-byte integer */ 

Program limits that are programmer decisions can also be specified symbolically. 

#define ITERS 50 /* number of iterations */ 

#define SIZE 250 /* array size */ 

fdefine EPS 1.0e-9 /* a numerical limit */ 

In general, symbolic constants aid documentation by replacing what might otherwise be 
a mysterious constant with a mnemonic identifier. They aid portability by allowing con¬ 
stants that may be system-dependent to be altered once. They aid reliability by restrict¬ 
ing to one place the check on the actual representation of the constant. 

Syntactic Sugar 

It is possible to alter the syntax of C toward some user preference. A frequent program¬ 
ming error is to use the token = in place of the token == in logical expressions. A pro¬ 
grammer could use 

#define EQ 

to defend against such a mistake. This superficial alteration of the programming syntax 
is called syntactic sugar. Another example of this is to change the form of the while 
statement by introducing “do,” which is an ALGOL style construction. 

#define do /* blank */ 

With these two #def i ne lines at the top of the file, the code 

while (i EQ 1) do { 

***** 

will become, after the preprocessor pass, 

while (i == 1 ) { 

***** 

Keep in mind that because do will disappear from anywhere in the file, the do-whi 1 e 
statement cannot be used. Using macros to provide a personal syntax is controversial. 
The advantage of avoiding == mistakes is offset by the use of an idiosyncratic style. 
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-—-—-—--—- 

83 Macros with Arguments 

So far, wc have considered only simple #defi ne preprocessing directives. We now want 
to discuss how we can use the #def i ne facility to write macro definitions with parame¬ 
ters. The general form is given by 

#define identifier ( identifier, ... , identifier') token_string opt 

There can be no space between the first identifier and the left parenthesis. Zero or 
more identifiers can occur in the parameter list. An example of a macro definition with 
a parameter is 

#define SQ(x) ((x) * (x)) 

The identifier x in the #defi ne is a parameter that is substituted for in later text. The 
substitution is one of string replacement without consideration of syntactic correct¬ 
ness. For example, with the argument 7 + w the macro call 

SQ(7 + w) expands to ((7 + w) * (7 + w)) 

In a similar fashion 

SQ(SQ(*p)) expands to ((((*p) * (*p))) * (C(*p) * (*p)))) 

This seemingly extravagant use of parentheses is to protect against the macro expand¬ 
ing an expression so that it led to an unanticipated order of evaluation. It is important 
to understand why all the parentheses are necessary. First, suppose we had defined the 
macro as 

#define SQ(x) x * x 
With this definition 

SQ(a + b) expands to a + b * a + b 


which, because of operator precedence, is not the same as 
((a + b) * (a + b)) 
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Now suppose we had defined the macro as 
#define SQ(x) (x) * (x) 

With this definition 

4 / SQ(2) expands to 4 / (2) * (2) 
which, because of operator precedence, is not the same as 

4 / ((2) * (2)) 

Finally, let us suppose that we had defined the macro as 
#define SQ (x) ((x) * (x)) 

With this definition 

SQ(7) expands to (x) ((x) * (x)) (7) 

which is not even close to what was intended. If, in the macro definition, there is a 
space between the macro name and the left parenthesis that follows, then the rest of 
the line is taken as replacement text. 

A common programming error is to end a #define line with a semicolon, making it 
part of the replacement string when it is not wanted. As an example of this, consider 

#define SQ(x) ((x) * (x)); /* error */ 

The semicolon here was typed by mistake, one that is easily made because program¬ 
mers often end a line of code with a semicolon. When used in the body of a function, 
the line 

x = SQ(y); gets expanded to x = ((y) * (y));; 

The last semicolon creates an unwanted null statement. If we were to write 

if (x == 2) 
x = SQ(y); 
el se 
++x; 

we would get a syntax error caused by the unwanted null statement. The extra semico¬ 
lon does not allow the else to be attached to the if statement. 
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Macros are frequently used to replace function calls by inline code, which is more 
efficient. For example, instead of writing a function to find the minimum of two values, 
a programmer could write 

i 

#define min(x, y) (((x) < (y)) ? (x) : (y)) 

After this definition, an expression such as 

m = min(u, v) 

gets expanded by the preprocessor to 

m = C((u) < (v)) ? (u) : (v)) 

The arguments of mi n() can be arbitrary expressions of compatible type. Also, we can 
use mi n() to build another macro. For example, if we need to find the minimum of tour 

values, we can write 

#define min4(a, b, c, d) min(min(a, b), min(c, d)) 

A macro definition can use both functions and macros in its body. For example 

#define SQ(x) ((x) * (x)) 

#define CUBE(x) (SQ(x) * (x)) . . .. 

#define F_P0W(x) sqrt(sqrt(CUBE(x))) /* fractional power:3/4 */ 

A preprocessing directive of the form 

#undef identifier 

will undefine a macro. It causes the previous definition ot a macro to be forgotten. 

Caution: Debugging code that contains macros with arguments can be difficult. To 
see the output from the preprocessor, you can give the command 

cc -E file.c 

After the preprocessor has done its work, no further compilation takes place. (See exer¬ 
cise 1, on page 396.) 
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8.4 The Type Definitions and Macros in stddef.h 

C provides the typedef facility so that an identifier can be associated with a specific 
type. A simple example is 

typedef char uppercase; 

This makes uppercase a type that is synonymous with char, and it can be used in dec¬ 
larations, just as other types are used. An example is 

uppercase c, u[100]; 

The typedef facility allows the programmer to use type names that are appropriate for 
a specific application. (See Section 9.1, “Structures,” on page 408, and Section 10.6, “An 
Example: Polish Notation and Stack Evaluation,” on page 468.) 

In this section, we are concerned with the implementation-specific type definitions 
and macros that are given in the header file stddef.h. They can occur in other standard 
header files as well. Here is how the type definitions might appear: 

typedef int ptrdiff_t; /* pointer difference type */ 

typedef short wchar„t; /* wide character type */ 

typedef unsigned size_t; /* the sizeof type */ 

The type ptrdiff_t tells what type is obtained with an expression involving the differ¬ 
ence of two pointers. (See exercise 7, on page 398.) The type wchar_t is provided to 
support languages with character sets that will not fit into a char. Some C compilers 
are not interested in providing such support. If that is the case, the type definition 
would probably be 

typedef char wchar_t; /* same as a plain char */ 

Recall that the sizeof operator is used to find the size of a type or an expression. 
For example, si zeof (doubl e) on most systems is 8. The type si ze_t is the type of the 
result of the sizeof operator. This is system-dependent, but it must be an unsigned 
integral type. We will see the use of size_t in our discussion of qsortO in the next 
section. 

The macro NULL is also given in stddef.h. It is an implementation-defined null 
pointer constant. Typically, NULL is defined to be 0, but on some systems it is given by 


#define NULL ((void *) 0) 
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8.5 An Example: Sorting with qsort() 

Programmers, for a variety of reasons, need to be able to sort data. If the amount of 
data is small, or the program does not have to run fast, we can use a bubble sort or a 
transposition sort to accomplish the task. If, however, there is a lot of data, and speed 
of execution is a concern, then we can use the function qsort() provided by the stan¬ 
dard library. (The name qsort stands for “quick sort.”) 

The function qsort () is useful because it is a lot faster than a bubble sort or a sim¬ 
ple transposition sort and it is quite easy to use. Another quicksort implementation, 
qui cksortO, however, can be even faster, but it requires more coding effort. (See exer¬ 
cises 33 and 34 on page 406. See also Section 8.15, “An Example: Quicksort,” on page 
391.) 

The function prototype for qsort C) is in stdlib.h. It is equivalent to 

void qsort(void *array, size_t n_els, size_t el„size, 

int compare(const void *, const void *)); 

Notice that the type size_t is used. Many other functions in the standard library also 
use this type. When qsort () is called, its first argument is the array to be sorted, its 
second argument is the number of elements in the array, its third argument is the num¬ 
ber of bytes in an element, and its fourth argument is a function, called the comparison 
function , which is used to compare elements in the array. In this function prototype for 
qsort (), the declaration of the fourth parameter is 

int compare(const void *, const void *) 

This is itself a function prototype, the prototype for the comparison function. The com¬ 
parison function takes as arguments two pointers to void. When compare () gets 
invoked, these two pointers will point to elements of the array. The comparison func¬ 
tion returns an i nt that is less than, equal to, or greater than zero, depending on 
whether its first argument is considered to be less than, equal to, or greater than its 
second argument. The two pointers are of type void * because they are meant to be 
generic. As we will see, qsort () can be used to sort arrays of any type. The type quali¬ 
fier const tells the compiler that the objects pointed to by the two pointers should not 
be modified. (See Section 6.19, “The Type Qualifiers const and volatile,” on page 
307.) 

Let us write a test program that illustrates the use of qsort (). In our program, we fill 
an array, print it, sort it with qsort (), and then print it again. 
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In file try„qsort.c 

#include <stdio.h> 

#include <stdlib.h> 

#include <time.h> 

#define N 11 /* size of the array */ 

enum when {before, after}; 

typedef enum when when; 

int cmp(const void *vp, const void *vq) ; 

void fill_array(double *a, int n); 

void prn_array(when val, double *a, int n); 

int main(void) 

{ 

double a[N]; 

< 

fiTI_array(a, N); 
prn_array(before, a, N); 
qsort(a, N, sizeof(double), cmp); 
prn_array(after, a, N); 
return 0; 

} 

int cmp(const void *vp, const void *vq) 

{ 

const double *p = vp; 

const double *q = vq; 

double diff = *p - *q; 

return ((diff >= 0.0) ? ((diff > 0.0) ? -1 : 0) : +1); 

} 


void fill_array(double *a, int n) 

{ 

int i; 

srand(time(NULL)); /* seed rand() */ 

for (i =0; i < n; ++i) 

a[i] = (rand() % 1001) / 10.0; 


} 
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void prn_array(when val, double *a, int n) 

{ 

int i; 

pri ntf ( n %s\n%s%s\n H , 

I! It 

((val == before) ? "Before " ; "After "), "sorting:"); 
for (i = 0; i < n; ++i) { 

if (i % 6 == 0) putcharC'\n'); 
printf("%10.If", a[i]); 

} 

putchar('\n'); 

> 

We want to discuss a number of points about this test program, but before we do, let 
us look at some typical output: 

Before sorting: 


1.5 

17.0 

99.5 

45.3 

52.6 

66.3 

3.4 

70.2 

23.4 

57.4 

6.4 


After sorting: 
99.5 

70.2 

66.3 

•*K jL 

57.4 

52.6 

45.3 

23.4 

17.0 

6.4 

3.4 

1.5 



1 1 1,11 1 " i 



aspaaipBB 


Dissection of the try_qsort Program 

B #define N 11 

/* size 

of the 

array */ 


To test the function qsortQ, we will create an array of size N. After we get the code to 
work for a small array, we can test it on a larger array. 

■ int crtip(const void *vp, const void *vq); 
void fill_array(double *a, int n); 
void prn_array(when val, double *a, int n); 


These are function prototypes. We can name our comparison function whatever we 
want, but the type of the function and the number and type of its arguments must 
agree with the last parameter in the function prototype for qsortQ. 
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int main(void) 

{ 

double a[N]; 

fi 1 l_array(a, N); 
prn_array(before, a, N); 
qsort(a, N, sizeof(double), cmp); 
prn_array(after, a, N); 
return 0; 


In mai n(), we declare a to be an array of doubl es. Because the purpose of our program 
is to test qsortO, we do not do anything exciting. All we do is fill the array, print it, 
use qsortQ to sort it, and then print the array again. 

■ qsort(a, N, sizeof(double), cmp); 

When qsortQ is invoked, we must pass it the base address of the array to be sorted, 
the number of elements in the array, the number of bytes required to store an element, 
and the name of our comparison function. 

■ int cmp(const void *vp, const void *vq) 

{ 

»!**•* 

This is the start of the function definition for our comparison function. The letter v in 
vp and vq is mnemonic for “void.” In the body of mai n() we pass the name of our com¬ 
parison function as the last argument to qsortQ. This occurs in the statement 

qsort(a, N, sizeof(double), cmp); 

In the body of qsortQ, which the programmer does not have access to, pointers to ele¬ 
ments of the array a will be passed to cmpQ. The programmer is not concerned with 
the internal details of qsortQ. The programmer only has to write the comparison 
function with the understanding that the parameters vp and vq are pointers to ele¬ 
ments of the array. 
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int cmp(const void *vp, const void *vq) 

{ 

const double *p = vp; 

const double *q = vq; 

double diff = *p - *q; 


srand(time(NULL)); 


/* seed rand() */ 


fglll 


In the body of our comparison function, we initialize vp to p and vq to q. If we do not 
qualify p and q with const, the compiler will complain. (See Section 6.19, “The Type 
Qualifiers const and volatile,” on page 307.) The variables p and q are of type 
pointer to doubl e because the elements of the array a are of type doubl e. We initialize 
di ff to the difference of the objects pointed to by p and q. 

■ return ((diff >= 0.0) ? ((diff > 0.0) ? -1 : 0) : +1); 

If di ff is positive, we return -1; if di ff is zero, we return 0; and if di ff is negative, we 
return 1. Tins causes the array to be sorted in descending order. We can cause the array 
to be sorted in ascending order if we replace this line with 

if (diff < 0.0) 
return -1; 
if (diff — 0.0) 
return 0; 
return 1; 

■ void fi ll_array(double *a, int n) 

{ 

int i; 


for (i 
a[i] 


0; i < n; ++i) 

= (randQ % 1001) / 10.0; 


r ’~-: - 




If" W;'-' m 

fcv « m&sa 




— mm 


Typically, the function call time (NULL) returns the number of seconds that have 
elapsed since 1 January 1970. ANSI C does not guarantee this, but this convention is 
widely followed. Passing time (NULL) to srandO causes the array a to be filled with 
different values every time the program is invoked. 
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8.6 An Example: Macros with Arguments 


’ in this section, we again fill arrays and sort them with qsortQ, but this time we use 
macros with arguments. We will call our program sort. 

J Let us write our program in three files: a header file sort.h and two ,c files. In the 
header file we put our #i ncl udes, our #def i nes, and a list of function prototypes. 

In file sort.h 


#include <stdio.h> 
#inelude <stdlib.h> 
#include <string.h> 
#include <time.h> 


#define 
#define 

ft M I 1 I 1 v» 

#define 
#Hpfinp 


fractional_part(x) 
random_char() 
random_float() 


/* size of a[] */ 
/* size of b[] */ 

(x - (int) x) 

(randQ % 26 + 'a') 

(randQ % 100 / 10.0) 


#define FILL(array, sz, type) \ 

if (strcmp(type, "char") == 0) \ 

for (i =0; i < sz; ++i) \ 

array[i] = random_char(); \ 

pi cp \ 

for (i =0; i < sz; ++i) \ 

array[i] = random_float() 

#define PRINT(array, sz, cntrl_string) \ 
for (i =0; i < sz; ++i) \ 

printf(cntr1_string, array[i]); \ 

putchar('\n') 

int compare_fractional_part(const void *, const void *); 
int lexico(const void *, const void *); 


The expression randQ % 1001 has an int value in the interval 0 to 1000. Because we 
are dividing this by the double value 10.0, the value of what is assigned to a[i ] is in 
the interval 0 to 100. 
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Dissection of the sort.h Header File 

■ #iinclude <stdio.h> 

#include <stdlib.h> 

#include <string.h> 

#include <time.h> 

The header file stdio.h contains the macro definition for NULL and the function proto¬ 
type for printfC). The header file stdlib.h contains the function prototypes for 
randO, srandC), and qsortQ. The header file time.h contains the function prototype 
for timeC). The function call time (NULL) will be used to seed the random number gen¬ 
erator. 


■ #define fractional_part(x) (x - (int) x) 

If x is a positive f 1 oat, then the expression x - (i nt) x yields the fractional part of x. 

■ #define random_char() (rand() % 26 + 'a') 

When rand() is invoked, it returns an integer value randomly distributed between 0 
and MAX_RAND, a symbolic constant defined in stdlib.h. Because MAX_RAND is typically 
more than 32 thousand, the expression rand() % 26 yields an integer value randomly 
distributed between 0 and 25. Because 0 + ' a' has the value ' a' and 25 + ' a' has the 
value ' z ’, the expression 

rand() % 26 + 'a' 

produces a character value randomly distributed between ’ a' and ' z 1 . 

■ #define random_float() (randO % 100 / 10.0) 

The value of the expression randO % 100 is an integer randomly distributed between 
0 and 99. Because the expression 10.0 is of type double, the value produced by rand() 
% 100 is promoted to a doubl e, and the expression 

rand() % 100 / 10.0 

as a whole is also of type doubl e. Its value is between 0 and 9.9. 



'Ll 

M. 


8.6 ▼ 'An Example: Macros with Arguments 379 


■ #define FILL(array, sz, type) \ 

if (strcmp(type, "char") == 0) \ 

for (i =0; i < sz; ++i) \ 

array[i] = random_char(); \ 

else \ 

for (i =0; i < sz; ++i) \ 

array[i] = random_float() 

In this macro definition, array, sz, and type are parameters. Unlike function defini¬ 
tions, no type checking gets done. It is the programmer’s responsibility to call the 
macro with arguments of the appropriate type. Note that the variable i is used in the 
body of the macro. Because it is not declared here, it has to be declared in main(), 
where the macro gets called. Consider the macro call 

FILL(a, n, "char"); 

When the macro gets expanded, we obtain 

if (strcmpC'char", "char") == 0) 
for (i =0; i < n; ++i) 
a[i] = random_char(); 

el se 

for (i =0; i < n; ++i) 
a[i] = random_float(); 

The identifiers array, sz, and type have been replaced by a, n, and "char", respec¬ 
tively. Note carefully that all but the last semicolon came from the preprocessor expan¬ 
sion mechanism. 

■ #define PRINT(array, sz, cntrl_string) \ 

for (i =0; i < sz; ++i) \ 

printf(cntrl_string, array[i]); \ 
putchar('\n') 

This macro can be used to print the values of elements of an array. Note that the con¬ 
trol string for pri ntf () is a parameter in the macro definition. 

m int compare_fractional_part(const void *, const void *); 

int lexico(const void *, const void *); 

These are prototypes of comparison functions that will be passed to qsortQ, Notice 
that with respect to type, they match the function prototype of the comparison func¬ 
tion in the function prototype for qsort (). 
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tvrint^if 1 US f.f lde i the rest of thc code for our P ro P'am. In mai n (), we fill an array, 
p nt it sort it, and print it again. Then we repeat the process, but this time with an 
array of a different type, 


In file main.c 


#include "sort.h" 

int main(void) 

{ 

char a[M]; 
float b[N]; 
i nt i ; 


srand(timeCNLILL)); 

FILL(a, M , "char"); 

PRINT(a, M, "%-2c"); 

S^Ca, M ’ si2e of(char), lexico); 

PRINT(a, M, "%-2c"); 

printfC"—\n"); 

FILL(b, N, "float"); 

PRINT(b, N, "%-6.If"); 

PRINTCb,’ N,’ "%-6°lf")° at) ’ com P are - fractional -Part); 
return 0; 


Notice that each time we invoke qsort(), we use a different 
is the output of our program: 


comparison function. Here 


qmzrhl ajoetbkwl tz 

aji b b e e f h h i j j k 1 1 m m 

9-4 0.2 5,1 6.7 5.4 5 3 

6.1 5.1 0.2 5.3 5.4 9.4 


tvi emhpfybpswaj 
oppqrstttvwwyzz 

6.1 9.6 2.8 8.8 8.5 
8.5 9.6 6.7 8.8 2.8 


Want t0 l00k at the two comparison functions. Pointers to void are used 

fullv pvniai S Jf reqaired by lhe function prototype of qsort() in stdlib.h. We will care- 
ully explain how these pointers get used in the comparison functions. 
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In file compare.c 


#include T, sort.h 


ii 


int compare_fractional_part(const void *vp, const void *vq) 

{ 

const float *p = vp, *q = vq; 
f1 oat x; 


} 


x = fractional_part(*p) 
return ((x < 0.0) ? -1 


- fractional_part(*q) 
(x == 0.0) ? 0 : +1) 


int lexico(const void *vp, const void *vq) 

{ 

const char *p = vp, *q = vq; 
return (*p - 

} 


p = vp, 

*q); 




l- , > 



Dissection of the compare_fractional_part() Function 

bi int compare_fractional_part(const void *vp, const void *vq) 

{ 


This function takes two const qualified pointers to void as arguments and returns an 
i nt. Because of this, the function can be passed as an argument to qsort(). 

a int compare_fractional_part(const void *vp, const void *vq) 

j 

= vp, *q = vq; 


const float 
float 


ii 

x: 


• < ■> * 



- - *T, 


mm 


The letter v in vp and vq is mnemonic for “void.” Because pointers to void cannot be 
dereferenced, we declare p and q to be pointers to f 1 oat and initialize them with vp 
and vq, respectively. Because an ANSI C compiler will complain if a const-qualified 
pointer is assigned to one that is not const qualified, we declare p and q to be const 
qualified. Notice that we did not declare x to be const qualified. If we had done so, we 
would be able to give x a value only by initializing it. 
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n x = fractional_part(*p) - fractional_part(*q); 
return ((x < 0.0) ? -1 : (x == 0.0) ? 0 : +1); 

The difference of the fractional parts of the objects pointed to by p and q is assigned to 
x. Then -1, 0, or +1 is returned, depending on whether x is negative, zero, or positive. 
Thus, when we call qsort() with compare_decima1_part () passed as an argument, 
the elements in the array get sorted according to their fractional parts. 

■ : □ ■ 1 

Observe that in the function 1 exi co() we defined p and q to be pointers to const 
char and initialized them with vp and vq, respectively. Then we returned the difference 
of what is pointed to by p and q. Thus, when we call qsort () with 1 exi co passed as an 
argument, the elements in the array get sorted lexicographically. 


8.7 The Macros in stdio.h and ctype.h 

The C system provides the macros getc() and putc() in stdio.h. The first is used to 
read a character from a file, the second to write a character to a file. (See Section 11.5, 
“An Example: Double Spacing a File,” on page 509.) Because the header file stdio.h con¬ 
tains the lines 

#define getchar() getc(stdin) 

#define putchar(c) putc((c), stdout) 

we see that getchar() and putcharO are also macros. They read characters from the 
keyboard and write characters to the screen, respectively. - 
The C system also provides the standard header file ctype.h, which contains a set of 
macros that test characters and a set of prototypes of functions that convert charac¬ 
ters. The preprocessing directive 

#iinclude <ctype.h> 

includes these macros and function prototypes. In the table that follows, we list the 
macros that test characters. These macros all take an argument of type i nt and return 
an i nt. 



Macro 

Nonzero (true) is returned if: 

isalpha(c) 

c is a letter 

isupper(c) 

c is an uppercase letter 

islower(c) 

c is a lowercase letter 

isdigit(c) 

c is a digit 

isalnum(c) 

c is a letter or digit 

isxdigit(c) 

c is a hexadecimal digit 

isspace(c) 

c is a white space character 

ispunct(c) 

c is a punctuation character 

isprint(c) 

c is a printable character 

isgraph(c) 

c is printable, but not a space 

iscntrl(c) 

c is a control character 

i sasci i (c) 

c is an ASCII code 


In the next table, we list the functions toupperQ and tolower(), which are in the 
standard library, and the macro toasci i (). The macro and the prototypes for the two 
functions are in ctype.h. The functions and the macro each take an i nt and return an 
i nt. In the table, we assume that c is a variable of integral type, such as char or i nt. 
Note carefully that the value of c stored in memory does not get changed. 


Call to the function or macro 

Value returned 

toupper(c) 

corresponding uppercase value or c 

tolower(c) 

corresponding lowercase value or c 

toascii(c) 

corresponding ASCII value 


If c is not a lowercase letter, then the value returned by toupper(c) is c. Similarly, if c 
is not an uppercase letter, then the value returned by tol ower (c) is c. 



















384 Chapter 8 y The Preprocessor 


8.8 Conditional Compilation 

H 1 l P H ePr i CeSS ° r haS directives for conditional compilation. They can be used for pro- 

t0 another 'Each nrem C ° de that is more P^able from one machine 

to anomer, hacJi preprocessing directive of the form 

#4 4 = 

...T. _p constantJntegraLexpression 

it i r a e r iden tifie r 

#i fndef identifier 

directive ^ C ° nditi ° nal COmpiJation of tlie ™de that follows until the preprocessing 
#endif 

rnu<n hp ed F ° r th ! interWning C0de t0 be implied, after #i f the constant expression 

* “ and aft6r #1fd6f ° r after #if defined, the named i S 

must have been defined previously in a #defi ne line, without an intervening 

#u n de f identifier 

™iy rmdelSed 0 ma<mAftCT #ifndef ’ the named identlto nmst be 

thJsizeof^onemtofn! 1 ! eXpre ® slon used in a Preprocessing directive cannot contain 

operator is available in AMSTr^^ the def1 ned preprocessin 8 operator. This 

P available m ANSI C, but not necessarily in traditional C. The expression 

defi ned identifier is equivalent to def i ned (identifier) 

i defined ' “• evaiuates * ° h«c 

#if defined (HP9000) || defined(SUN4) && !defined(VAX) 

#endif macblne ~dependent code */ 

theXTa ffle'lle w r ” e Sta,emen,S ” useful for debl «ging purposes. Suppose that at 


#define DEBUG 1 
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and then throughout the rest of the file we write lines such as 
#if DEBUG 

printf("debug: a = %d\n", a); 

#endif 

Because the symbolic constant DEBUG has nonzero value, the pri ntf () statements will 
be compiled. Later, these lines can be omitted from compilation by changing the value 
of the symbolic constant DEBUG to 0. 

An alternate scheme is to define a symbolic constant having no value. Suppose that 
at the top of a file we write 

#define DEBUG 

Then we can use the #ifdef or #i f defi ned forms of conditional compilation. For 
example, if we write 

#ifdef DEBUG 

* « « * » 

#endif 

then the intervening lines of code will be compiled. When we remove the #def i ne line 
that defines DEBUG from the top of the file, the intervening lines of code will not be 
compiled. 

Suppose we are writing code in a large software project. We may be expected to 
include at the top of all our code certain header files supplied by others. Our code may 
depend on some of the function prototypes and on some of the macros in these header 
files, but because the header files are for the project as a whole, our code might not use 
everything. Moreover, we may not even know all the things that eventually will be in the 
header files. To prevent the clash of macro names, we can use the #undef facility: 

#inc!ude "everything.h" 

#undef PIE 

fdefine PIE "I like apple." 

« i • ♦ t 

If PIE happens to be defined in everything.h, then we have undefined it. If it is not 
defined in everything.h, then the #undef directive has no effect. 

Here is a common use of conditional compilation. Imagine that you are in the testing 
phase of program development and that your code has the form 
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statements 

more statements 

and still more statements 

o^SS::Sr’ y0U may WiSh l0 tem P° raril y disregard, or bl 

y code. To do this, you can try to put the code into a comment. 

statements 

/* 

more statements 

V 

and still more statements 

However, if the code to be blocked out contains comments this method will re<n,n h 
y cue error, i ne use of conditional compilation solves this problem. 

statements 
#if 0 

more statements 
#endif 

and still more statements 

C EachTf^T-Tr 1198 C ° ntro1 structures that are similar to the if-el se statement 
e. Each ot the #if forms can be follow^ 1 ™ C1 . iUremcri t 

preprocessing directives of ,helm, V " Umber ° f lmes ' »° ssM y 

#el i f constants integrals expression 

possibly followed by the preprocessing directive 
#else 

and, finally, followed by the preprocessing directive 
#endif 

Note that #el i f is a contraction for "else-if." The flow of control for rnnHit™.. 
lation ,s analogous to that provided by i f-el se statements * t, ° nal comp ‘ 
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8,9 The Predefined Macros 

In ANSI C, there are five predefined macros. They are always available, and they cannot 
be undefined by the programmer. Each of these macro names includes two leading and 
two trailing underscore characters. 






Predefined macro 


.DATE__ 


_LINE. 


_STDC. 


TIME. 


Value 


A string containing the current date 
A string containing the file name 


An integer representing the current line n um ber 


If the implementation follows ANSI Standard C, 
then the value is a nonzero integer. 


A string containing the current time 


8.10 The Operators # and ## 


mm fil sH 


The preprocessing operators # and ## are available in ANSI C but not in traditional C. 
The unary operator # causes “stringization” of a formal parameter in a macro defini¬ 
tion. Here is an example of its use: 

#define message_for (a, b) \ 

printf(#a " and " #b We love you!\n") 


int main(void) 

{ 

message_for(Carole, Debra); 
return 0; 

} 

When the macro is invoked, each parameter in the macro definition is replaced by its 
corresponding argument, with the # causing the argument to be surrounded by double 
quotes. Thus, after the preprocessor pass, we obtain 
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int main(void) 

{ 

printf("Carole" " and " "Debra" We love you!\n"); 
return 0; 

} 

Because string constants separated by white space are concatenated, this pri ntf () 
statement is equivalent to 

printf("Carole and Debra; We love you!\n"); 

In the next section we use the “stringization” operator # in the assertO macro. 

The binary operator ## is used to merge tokens. Here is an example of how the oper¬ 
ator is used: 

fdefine X(i) x ## i 
X(l) = X(2) = X(3); 

After the preprocessor pass, we are left with the line 


8.11 The assertO Macro 

ANSI C provides the assertO macro in the standard header file assert.h. This macrc 
can be used to ensure that the value of an expression is what you expect it to be. Sup 
pose that you are writing a critical function and that you want to be sure the arguments 
satisfy certain conditions. Here is an example of how assertO can be used to do this: 

#include <assert.h> 

void f(char *p, int n) 

{ 

assert(p != NULL); 
assert(n > 0 && n < 7); 


If an assertion fails, then the system will print out a message and abort the program. 
Although the assertO macro is implemented differently on each system, its general 
behavior is always the same. Here is one way the macro might be written: 
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#include <stdio.h> 

#include <stdlib.h> /* for abort() */ 


#if defined(NDEBUC) 

#define assert(ignore) ((void) 0) 


c* o 

frC I 

#define assert(expr) \ 

if (!(expr)) { \ 

printf("\n%s%s\n%s%s\n%s%d\n\n", \ 

"Assertion failed: ", #expr, \ 

"in file " FT I F \ 

9 If J lnlllt juyf I m mm Iwww . M fcwwwM v .. v . f ^ ^ 

"at line ",_LJNE_) » \ 

abort(); \ 

} 

#endif 


/* ignore it */ 


Note that if the macro NDEBUGQ is defined, then all assertions are ignored. This allows 
the programmer to use assertions freely during program development, and then to 
effectively discard them later by defining the macro NDEBUG. The function abort () is in 
the standard library. (See Appendix A, “The Standard Library.”) 


3S8SSE 


8.12 The Use of ferror and fpragma 

ANSI C has added the #error and #pragma preprocessing directives. The following 
code demonstrates how #error can be used: 

#if A_SIZE < B_SIZE 

#error "Incompatible sizes" 

#endif 

If during compilation the preprocessor reaches the #error directive, then a compile- 
time error will occur, and the string following the directive will be printed on the 
screen. In our example, we used the #error macro to enforce the consistency of two 
symbolic constants. In an analogous fashion, the directive can be used to enforce other 
conditions. The #pragma directive is provided for implementation-specific uses. Its gen¬ 
eral form is 

#pragma tokens 

It causes a behavior that depends on the particular C compiler. Any #pragma that is not 
recognized by the compiler is ignored. 
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8-13 Line Numbers 

A preprocessing directive of the form 
#line integraLconstant "filename" 

stTnTanri^nu? Pile t r / 0 f v imber the SOlirce text 80 the next line has the specified con- 
name is ~ !1 be ie T tllat the current S0lirce file name is filename. If no file 
Wc L ! i then only the renu mbenng of lines takes place. Normally, line num- 
t erro C r f dd6n fr ° m the P ro g ram nier and occur only in reference to warnings and syn- 


8.14 Corresponding Functions 

fLt^L C have mr°rL thC ^ parameters that are S^en in the standard header 

wo wl t ii r ?? ” 8 functions in the standard library. As an example, suppose 

to write SS ^ functl0n 1 sa1 P ha O instead of the macro. One way to do this is 

#undef isalpha 

the m m!cro dffiI,tt'i„ f !! e F bef0re i , Salpha ° ls ln ™ tad - Thls effect of discarding 

include the header filter! W * W0Uld Stl11 

macros, ,he fUe b “*»“ *“» “ 

Another way to obtain the function instead of the macro is to write 
(isalpha)(c) 

The preprocessor does not recognize this construct as a macro, but the compiler recog¬ 
nizes it as a function call. (See exercise 8, on page 398.) % 
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8.1 5 An Example: Quicksort 

Quicksort was created by C. Anthony R. Hoare and described in his 1962 paper “Quick¬ 
sort” (Computer Journal, vol. 5, no. 1). Of all the various sorting techniques, quicksort is 
perhaps the most widely used internal sort. An internal sort is one in which all the data 
to be sorted fit entirely within main memory. 

Our quicksort code makes serious use of macros. When sorting lots of data, speed is 
essentia], and the use of macros instead of functions helps to make our code run faster. 
As we will see, all the macros that we use are quite simple. We could replace them by 
inl in e code, but their use makes the code more readable. Because quicksort is impor¬ 
tant, we will explain it in some detail. 

Let us suppose that we want to sort an array of integers of size n. If the values of the 
elements are randomly distributed, then, on average, the number of comparisons done 
by quicksort is proportional to n log n. But in the worst case, the number of compari¬ 
sons is proportional to n 2 . This is a disadvantage of quicksort. Other sorts—mergesort, 
for example—even in the worst case, do work proportional to n log n. However, of all 
the n log n sorting methods known, quicksort is, on average, the fastest by a constant 
factor. Another advantage of quicksort is that it does its work in place. No additional 
work space is needed. 

Our quicksort code is written in a single file. We will describe the elements of the 
code as we present it. At the top of the file we have 


/* Quicksort! Pointer version with macros. */ 


#define 
#define 
#define 
#define 


swap(x, y) 
order(x, y) 
o2(x, y) 
o3(x, y, z) 


{ int t; t = x; x = y; y = t; } 
if (x > y) swapCx, y) 
orderfx, y) 

o2(x, y) ; o2(x, z); o2(y, z) 


typedef enum (yes, no} yes_no 


static yes_no 
static int 


find„pivot(int *left, int * right, int *pivot_ptr); 
*partition(int *1 eft, int ’-right, int pivot); 


We have not written our macros to be robust. They are intended for use only in this file. 
A typedef has been used to make the type yes_.no synonymous with the enumeration 
type enum {yes, no}. Because the two functions find_pivot() and partition() 
have stati c storage class, they are known only in this file. 
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void quicksort(int deft, int *right) 
int *p, pivot; 

i 

if (find_pivot(left, right, &pivot) == yes) { 
p - partition(left, right, pivot); 
quicksort(left, p - 1 ); 
quicksort(p, right); 

} 

} 

Quicksort is usually implemented recursively. The underlying idea is to “divide and 

conquer. Suppose that in mai n() we have declared a to be an array of size N. After the 
array has been filled, we can sort it with the call 

quicksorts, a + N - 1); 

The first argument is a pointer to the first element of the array; the second argument is 
a pointer to the last element of the array. In the function definition for qui cksort(), it 
is convenient to think of these pointers as being on the left and right side of the array 
respectively. The function find_pivot() chooses, if possible, one of the elements of 
e array to be a pivot element.” The function parti tion() is used to rearrange the 
array so that the first part consists of elements all of whose values are less than the 
pivot and the remaining part consists of elements all of whose values are greater than 
or equal to the pivot. In addition, parti ti on 0 returns a pointer to an dement n the 
array. Elements to the left of the pointer all have value less than the pivot, and elements 
to the right oi the pointer, as well as the element pointed to, all have value greater than 

or equal to the pivot. Once the array has been rearranged with respect to the pivot; 
qui cksortQ is invoked on each subarray. 

static yes_.no find_pivot(int *left, int ’’'right, int *pivot_ptr) 
int a, b, c, *p; 

h I , ■ , /* "left value */ 

r ~ + (right - left) / 2); /* middle value */ 

‘ /* right value */ 

ilfl D ’,P’ . /* order these 3 values */ 

l , pivot will be higher of 2 values */ 

'•pivot_ptr = b; 

return yes; 
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static int *partition(int *left, int *right, int pivot) 

{ 

while (left <= right) { 

while ( ,v left .< pivot) 

++left; 

while (*right >= pivot) 

—right; 

if (left < right) { 
swap(*1eft, * right); 

++left; 

—right; 

} 

} 

return left; 


The major work is done by parti ti on (). We want to explain in detail how this func¬ 
tion works. Suppose we have an array a[] of 12 elements: 


When find_pivot() is invoked, the first, middle, and last elements of the array are 
compared. The middle value is 5, and because this is larger than the smallest of the 
three values, this value is chosen for the pivot value. The following table shows the val- 


if (b < c) { 

*pi vot_.pt r = c; 
return yes; 

} 

for (p = left +1; p <= right; ++p) 
if (*p ! = ’-left) { 

*pi vot_.ptr = (*p < *left) ? *left : *p; 
return yes; 

} 

return no; /* all elements have the same value */ 

} 


Ideally, the pivot should be chosen so that at each step the array is partitioned into two 
parts, each with an equal (or nearly equal) number of elements. This would minimize 
the total amount of work performed by qui cksort (). Because we do not know a priori 
what this value should be, we try to select for the pivot the middle value from among 
the first, middle, and last elements of the array. In order for there to be a partition, 
there has to be at least one element that is less than the pivot. If all the elements have 
the same value, a pivot does not exist and no is returned by the function. (See exercise 
25, on page 404 and exercise 26, on page 404, for further discussion.) 
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each Pass °5 the »M 1 e loop in the pa rti - 
e elements that were swapped in that pass are boxed. 

Unordered data: 1 4 3 c -> r Q 

First pass: -3 4 3 ^ ^ 5 8 2 1 9 -6 -3 

Second pass: -3 4 3 _ 6 \ 5 « l } 9-6 7 

Third pass: -3 4 f 5 \ J ? 7 1 9 5 7 

Fourth pass: -3 4 4 3 -1 2 1 f | 5 5 ° 5 5 ? 7 

- ith ^ ° t0 6 have -lue less than the 

address of a [ 7 ] is returned when the t^rto*^*** ^ ^ t0 ^ plVOt - The 


Summary 

1 ““a. Files 
include <filename> #include " filename " 

2 string/rhe g *e a symbobc name t „ a token 

text before compilation. CS e strmg for the sv ' mhol| e nam e in the source 

3 andp«a°S %£££** “ deflM Sy ” b ° liC COns, “' s «■»»» readability 

4 maCr ° faCl “ ty W ‘ th ar8Umem subs '“«'“>»- A 

parameters is defined by a preprocessing directive of the form 

'define idem,f,erf identifier . identifier ) ,oke„_stnng opt 

An example is given by 

#define SQ(X) (( X ) * ( X )) 


This macro provides inline code to perform squaring of a value. 
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5 The preprocessor provides for conditional compilation to aid in program testing, to 
facilitate porting, and so on. Lines beginning with #i f , #i fdef, #i f ndef, #el i f, 
#el se, and #endi f are used for this. 

6 The def i ned operator can be used with preprocessing directives. An example is 

#if (defined(HP3000) || defined(SUN3)) && !defined(SUN4) 

. /* machine-dependent code */ 

#endif 

7 An effective way to block out sections of code for debugging purposes is to use 
#if 0 

#endif 

8 ANSI C has introduced the preprocessing operators # and ##. The # operator is 
unary. It can be applied to a formal parameter in the body of a macro, causing 
replacement text to be surrounded by double quotes. This effect is called “stringi- 
zation.” The ## operator is binary. It causes the pasting together of two tokens. 

9 ANSI C provides the assert O macro in the header file assert.h. Assertions can be 
used to ensure that an expression has an appropriate value. 

10 The function qsortO is provided by the C system. Its function prototype is in 
stdlib.h. Although qsortO is faster than a bubble sort or a simple transposition 
sort, it is often not as fast as quicksort. An advantage of qsortO is that it is easy to 
use. 

11 Quicksort is one of the most widely used sorting algorithms. It is typically faster by 
a constant factor than other well-known n log n sorting methods, such as merge- 
sort. 
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Exercises 

1 that J contains macros with arguments can be difficult to debug Most C 

screen w^nTfurth 11 ° Ptl ° n * at Causes the Preprocessor to write its output on the 
try_mec compilation taking place. Put the following code in a file, say 

#inc!ude <stdio.h> 

#define PRN(x) printf("x\n"); 

int main(void) 

{ 

PRN(Hello from main()); 
return 0: 

} 

expected^To see hnu,°fh p m ^ ^ Y ° U WiU SGe that il does not P rint what was 
expected, l o set how the preprocessor treats this code, give the command 

cc -E fry. me. c 

option ^not theri^t Want f t0 tal<e 3 carefuI look at what S ets produced.) If the -E 
Note thnt thl h ! g , f ° r y0Ur com P iler - find ou t what the correct option is 

why. g6nerated ^ thC ^ P ~ 

2 Consider the following macro definition: 

foreverCmore) eVer ^’ X ' ) fore ™r(foreverCx)) 

This looks like it will produce infinite recursion, but in ANSI C the preprocessor is 

Sdtato™"' 1 ' '° k " OW th “' ‘ nfinlte recursl “ 1* "« "'ha, is 
nenuca. mow does youi preprocessor expand this macro? 

3 have the valm I'T?? ^ var f hles of type f 1 oat in a program. If these variables 

e values 1.1, 2.2, and 3.3, respectively, then the statement 

PRN3(x , y, Z ); 


I 
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should cause the line 

x has value 1.1 and y has value 2.2 and z has value 3.3 
to be printed. Write the macro definition for PRN3Q. 

4 Suppose we have the following code in a file named n.b.c.h. - 

#define TRUE 1 

#define A_B_C int main(void) \ 

{ \ 

printf("A Big Cheery \"hello\"! \n") ; \ 

return 0; \ 

} 

and the following code in a file named a.b.c.c: 

#if TRUE 

#include <stdio.h> 

#include "a_b_c.h" 

A_B_C 

#endif 

When we try to compile the program, the compiler complains. Why? Can you inter¬ 
change two lines in one of the files so that the program compiles and runs? Explain. 

5 Macros are. not always as safe as functions, even when all the parameters in the 
body of the macro definition are enclosed in parentheses. Define a macro 

MAX(x, y, z) 

that produces a value corresponding to the largest of its three arguments. Con¬ 
struct some expressions to use in MAXQ that produce unanticipated results. 

6 Are all of the predefined macros available on your system? Try the following code: 

printf("%s%s\n%s%s\n%s%d\n%s%d\n%s%s\n", 

" DATE.. = ", ..DATE.., 

"..FILE.. = ", ..FILE.., 

"..LINE.. = ", ..LINE.., 

"_STDC_= ",_STDC_, 

" .TIME.. = ", ..TIME..): 
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#include <ctype.h> 
^include <stdio.h> 
#include <time.h> 


int main(void) 

{ 

int c: 


prin tf ("Cl° ck ticks: %ld\n", clockQ) 
while C(c = getcharQ) != EOF) 
if (islower(c)) 
putchar(toupper(c)); 
else if (isupper(c)) 
putchar(tolower(c)); 
else if (isdigit(c)) 


/* start the clock */ 


<time' L h! ' hiS pro8 ';™- (l ;, ead ab out clockQ in Section A.15, “Date and Time: 
t me • h>< on Pa§ e 67j -) If c is a digit, write the character x; if c is a punctuation 


#include <ctype,h> /* function prototype here? */ 
if ((isalpha)Ca')) 

printf ( Found the isalphaO function!\n") ; 

tionTiftpr »?1 T Pri f ed if y0Ur system does not provide the corresponding func- 
tions. After all, they do not get used very much. 


0 you have access to an ANSI C compiler on a small system? On small systems 
compi ers such as Borland C and Microsoft C provide for different memory models 
and each memory model usually requires a specific type definition for ptrdiff t 

whv thl S Iif ef ' h 3nd S6e lf this is lhe case on V° ur small system. Can you explain 
y the different memory models require their own ptrdi ff_t type? 


8 In ANSI C, many of the macros with arguments defined in the standard header files 
are required to be available also as functions. Does your system provide these func¬ 
tions. See, tor example, if your system will accept the code 


ro . e rep ut atl °n for being an excellent language for character processing. This 
putation is due, m part, to the fact that macros rather than functions are used 
extensively m character processing. Programmers believe that the use of macros 
t Ca T red T :e execution time significantly. Is this really true? In this exercise, we want 

o test tins belief. Begin by writing a program that uses the macros in stdio h and 
ctype.h extensively. 
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character, do not write anything; if c is a white space character, write it twice. Just 
before the return from mai n(), write the line 

printfC'Clock ticks: %ld\n", clockQ); /* ticks up to now */ 

Now, write another version of your program in which each macro is replaced by its 
corresponding function. For example, Instead of islower(c) write (islower) (c). 
Use redirection to process character files with each version of your program. On a 
small file, due to system overhead, there should not be much difference in running 
time, but as the file gets larger, the difference should be more pronounced. Is it? 
Hint: Give the command 


pgm < input > output 

so that you will not have to waste time printing the file on the screen. 


In ANSI C, the standard header files can be included repeatedly and in any order. 
Change the first version of the program that you wrote in the previous exercise by 
duplicating the #i ncl ude lines at the top of the file a number of times: 


#include 
#include 
#1nclude 
#include 
#include 
#include 


<ctype.h> 
<stdio.h> 
ctime.h> 
<ctype.h> 
<stdio.h> 
<time.h> 


. /* repeat a number of times */ 

int main(void) 

{ 

* M W # * 


Does your compiler complain? Although the program might compile a little more 
slowly, it should execute just as fast. Does it? 

We listed i sasci i () as a macro in ctype.h. However, this macro is not really speci¬ 
fied in the ANSI C documents. (Perhaps the ANSI C committee did not want to show 
any favoritism for ASCII codes over any other.) Check to see if i sasci i () is pro¬ 
vided by your system. 
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S®&wl«5es*tv «ww!bi 
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12 In this exercise, we want to warn you about a subtle difference between traditiotti 
C and ANSI C. In traditional C, tolowerC) and toupper() are provided as macro 
in ctype.h. In ANSI C, the corresponding macros are available in ctype.h, but thi 
have been renamed as _tol ower() and _toupper(). (Check to see that they £# 
available on your system.) In traditional C, it makes sense to use an expression sucjjp 
as toupper(c) only if you already know that c is a lowercase letter. In ANSI cf 
toupperf) is implemented as a function, and the expression toupper(c) makjffc 
sense no matter what integral value c might have. If c is not a lowercase letter, then 
toupper(c) has no effect; that is, it returns c. Similar remarks hold with respect to? 
tol ower (). Experiment with your system. See what is produced by the expression^ 
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tolowerC a') 


.tolower('a’) 


toupper('A') 


A\v 

.toupperf'A') s 


13 The stringization operator # causes an argument that is passed to a macro to bft 
surrounded by double quotes. What happens if the argument is already surrounded g 
by double-quote characters? Write a test program that contains the following codejlr 

? pj|f 

fdefine YANK(x) s = #x § 

char *s; f 

YANK("Go home, Yankee!"); i» 

printf("%s\n", s); 



16 


■ ' I 


Write another version of your program that does not contain the # operator. Exe- 

cute both versions. How does the output of one version differ from the other? Use ? 

the -E option (or whatever is required by your system) to get the preprocessor toll 

expand your code. What does the preprocessor do differently when the # operator 1§ 
is present? 



. JHL 

W-A . ‘ ^.(.r-MviS . 


MBBl SB! 


14 What gets printed? Explain. 

#define GREETINGS(a, b, c) \ 

printf(#a ", " #b ", and " #c " 



Hello!\n") 


int main(void) 

{ 

GREETINGS(A1ice, Bob, Carole); 
return 0; 

} 


i £%"■ (/- v. '■. • -T' 

v,J, r\ ■-, • jjv&l 
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Hjapjag; ^agggHP 
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Look what is 
GREETINGS? 


is produced by the preprocessor before compilation. Can you find 
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Consider the directive 
#undef TRY_ME 

If TRY_ME was previously defined with a #def i ne macro, this line causes the macro 
to be discarded. If TRY_ME was not previously defined, then the line should have no 
effect. Write some code to test what happens on your system. If TRY_ME was not 
previously defined, does your system complain? 

The assert () macro is supposed to be discarded if the macro NDEBUG is defined. 
Does it work as expected on your system? Try the following program: 

#define NDEBUG 

#include <assert.h> 

int main(void) 

{ 

int a = 1, b = 2; 

assert(a > b); 
return 0; 

} 

What happens if you interchange the first two lines of the program? 

In the program in the previous exercise, replace the two lines at the top with the fol¬ 
lowing three lines: 

#include <assert.h> 

#define NDEBUG 
#include <assert.h> 

Does your C compiler complain? Should your C compiler complain? Hint: Check to 
see if the line 

#undef assert 


is in the assert.h header file. 
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to nnl rVi h > f° Vmg 3 large c P ro e ram from a machine that has 4-byte words* 
ie that has 2-byte words. On a machine with 2-byte words, an i nt is restricted 

to values that he (approximately) between -32,000 and +32,000. Suppose that this! 

range o values is ,„„ restrictive for some parts of the program thafyou re mt 
mg. If you put the line uv . 

#define int 1onq J 

'll 

* nt ,? rt deade | Ede *^ a * S G,S included with each of the files making up your program •■ 
will this work? Explain. i y piogram, 

m 

1 thhf t t ^AH y, ? def f J!l t 1 Z , e_t in the header me stdde f- h on your system. Search for - 
yp e in stdhb.h also. Suppose you are writing some code that starts with ‘S|§ 

^include <stddef.h> M 

#include <stdlib.h> 

Sgjji 
' X”^' v 

vent tE “ f type definitions do not work, your system must be able to pre-ll 
nism worfe P6def f ° r 51Ze_t fr ° m bdng included twice ‘ Ex P lain h °w this mecha-fj 

“11 

20 f n ™° P \^V° USe qS0rt °' you have t0 know what its function prototype is. On'^l 
some ANSI C systems it is given in stdlib.h as J| 

». “v. 

void qsort(void *base, size_t neiem, size_t width, 1 

int (*compare)(const void *, const void *)); 

What is provided by your system? Is it equivalent to tills? Remember, in a function ( 

Hk>nr°f! 1 n C, 1 ' he Parameter ldenUfiers are discarded by the compiler. Thus, an equiv- ' 
alent iunction prototype is given by 


void qsort(void *, size_t, size_t, int (*)()); 


21 In the qsort program, we used two comparison functions. What happens if vou 
rewrite the comparison functions in such a way that they do not match the last 

parameter in the function prototype for qsort()? Rewrite them as 

int compare_decimal_part(f 1 oat *p, float * q ) 
float x; 

x = decimal_part(*p) - decimal_part(*q); 
return ((x == 0 . 0 ) ? 0 ; (x < 0 . 0) ? V: +1) . 
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int lexico(char *p, char *q) 

{ 

return (*p - *q); 

} 

Also, change the corresponding function prototypes in mainQ. Will the program 
compile now? If it does, take a careful look at the function prototype for qsort () 
as given in stdlib.h on your system. Is it the case that the last parameter is some¬ 
thing like (compare *) ()? If the program does not compile, can you cast the last 
argument in the two calls to qsort() so that it does? 

Write a program to test the quicksort code. Begin with the following code: 

#include <stdio.h> 

#include <stdlib.h> 

#include <time.h> 

#define N 10000 

void quicksort(int *, int *); 

int main(void) 

{ 

int a[N], i ; 

srand(time(NULL)); /* seed the random number generator */ 

for (i = 0 ; i < N; ++i) 
a[i] = rand() % 1000 ; 
quicksorts, a + N - 1); 


Complete this program by writing code that prints the sorted array elements. If you 
get tired of looking at all those elements, you can print just enough of the begin¬ 
ning, middle, and end of the array so that you believe it has been sorted. 

In the previous exercise, you sorted an array of size 10,000 with integer entries that 
were randomly distributed in the interval [0, 999], Run the program again, this time 
keeping track of its running time. Now 7 change the program so that the entries in 
the array are randomly distributed in the interval [0, 9], Give a heuristic argument 
to explain why the times are so different. 

The quicksort algorithm can be used to sort ail kinds of arrays, not just arrays of 
integers. Rewrite the quicksort code so that it can be used to sort an array of 
strings. Write a program to test your code. 
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25 U an array has just a few elements, say seven or less, then a bubble sort or a trans 
position sort should be faster than quicksort. The following version of quick 
sort() takes this into account: 

int quicksort(int *1eft, int *right) 

{ 

int *p, *q, pivot; 

static int cnt = 0; 

if (right - left < 7) { 

for (p = left; p < right; ++p) 

for (q = p + 1; q <= right; ++q) 
if (*p > *q) 
swap(*p, *q); 

} 

else if (find_pivot(1 eft, right, Apivot) == yes) { 
p = partition(left, right, pivot); 
quicksort(left, p - 1); 
quicksort^, right); 

} 

return ++cnt; 


Note the use of the variable cnt. The value returned to the calling environment is 
the number of times that the function gets called. Experiment to see if this new ver¬ 
sion of quicksortO executes faster. Is there a correlation between running time 
and the number of times the function gets called? 

26 Having a good algorithm for finding the pivot element in quicksort can be crucial. A 
simple algorithm is to find two distinct elements of the array and to choose the 
larger value as the pivot. With this algorithm, quicksort takes time proportional to 
n instead of n log n if the array happens to be already in order, or nearly so. This is 
an important point, because, in practice, arrays are often partially in order to begin 
with. Rewrite the quicksort code to implement this algori thm and write a test pro¬ 
gram that illustrates the poor behavior of quicksort when the array is in order to 
begin with. What happens if the array starts in reverse order? 

27 To find the pivot element, we choose from among three elements of the array. 
Choosing from among five elements should reduce the running time somewhat. 
Implement this strategy by rewriting the quicksort code, using macros where 
appropriate. Write a program to test whether the running time is less. The optimal 
strategy for finding the pivot depends on the data in the array. Choosing from 
among live elements is a common strategy. What about choosing from among 
seven elements? If you are ambitious, try that strategy as well. 
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8 Suppose that a[] is an array of integers of size 100, and that for each i the element 
a [i ] has value i. If qui cksort (a, a + 99) is invoked, how many function calls to 
quicksortO are made? Compute this number for each version of find_pivot(). 

9 The pointer that is returned by parti ti on () is used to break the original array into 
two subarrays. The size of the first subarray is called the partition break size. Use a 
random-number generator to fill an array of size 100. Invoke find_pivot() and 
parti ti on () to find the partition break size for the array. Do this repeatedly, say 
100 times, and keep track of the running average of the break size. One expects the 
average break size to correspond to the middle of the array. Does this seem to be 
true from your experimentation? 

30 The optimal break size for an array of size n is n/2. This identifies two subarrays of 
equal, or nearly equal, size for further processing. For example, given an array of 
size 100, a break size of 50 is optimal. Notice that a break size of 49 identifies sub- 
arrays of sizes 49 and 51, and that a break size of 51 identifies subarrays of sizes 
51 and 49. Thus, the break sizes 49 and 51 are both of equal merit. Modify the pro¬ 
gram you wrote in exercise 27, on page 404, so that you keep track of the running 
average of the absolute value of the difference k - 50, where k is the break size 
obtained from parti ti on (). This number corresponds inversely to how good the 
break size is. More generally, define 

\k - (»/2)| 


where k is the partition break size obtained from parti ti on () acting on an array 
of size n. Fill arrays randomly and run some machine experiments to see what, if 
anything, can be said about m. 

31 In the discussion that follows the code for the parti ti on () function, we presented 
a table that shows the elements of the array a [] after each pass of the outer whi 1 e 
loop. Find an array of 12 distinct elements that will produce the maximum number 
of passes. Write down, a table that shows the elements of the array after each pass. 
Box the elements that were swapped. After you have done this by hand, write a pro¬ 
gram to create this table. To indicate the elements that were swapped in each pass, 
you can surround them with double quotes. 

32 Compare quicksortO with mergesort(). (See Section 6.9, “An Example: Merge 
and Merge Sort,” on page 266.) Time both functions with arrays having 100, 1,000, 
and 10,000 elements. For a small amount of data, run-time overhead dominates. 
That is, setting up the functions, initializing values, and other miscellaneous steps 
dominate the actual work required to perform the sorting. For large arrays, qui ck¬ 
sort () should be faster than mergesortQ. is this true? 
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33 If you have a small amount of data to be sorted, a bubble sort or transposition sort 
works fine. If you have more data, then qsortO can be used. Although qsort() 
certainly is a lot faster than a bubble sort, for most implementations it is not as fast 
as qui cksortO. The advantage of qsortO over quicksortO is that it can be 
implemented with just a few lines of code. The advantage of quicksortO over 
qsortO is that it is often faster. By how much? On our system, the answer is that it 
is a lot faster. Write a program to test this. In your program, declare two large 
arrays. Fill the first array with randomly distributed integers and copy it into the 
second array. Time how long it takes qsortO to sort the first array and how long it 
takes qui cksortO to sort the second array. Print these two values and their quo¬ 
tient. Because of system overhead, the size of the quotient will depend on the size 
of the arrays being sorted. It will also depend on how much effort you put into fine- 
tuning your quicksort algorithm. In any case, remember what quotients you get (or 
jot them down in the margin). 

34 In the previous exercise, you computed the quotient of running times required to 
sort a large array, first with qsortO and then with quicksortO. In this exercise, 
you are to compute those quotients again, first with your array elements randomly 
distributed in the interval [0, 100000] and then with array elements randomly dis¬ 
tributed in the interval [0, X]. The results of this exercise can be quite surprising. 
Try it. 



C is an easily extensible language. It can be extended by providing macros that are 
stored in header files and by providing functions that arc stored in libraries. It can also 
be extended by defining data types that are constructed from the fundamental types. 
An array type is an example of this; it is a derived type that is used to represent homo¬ 
geneous data. In contrast, the structure type is used to represent heterogeneous data. A 
structure has components, called members, that are individually named. Because the 
members of a structure can be of various types, the programmer can create aggregates 
of data that are suitable for a particular application. 


Structures 


The structure mechanism provides a means to aggregate variables of different types. 
As a simple example, let us define a structure that describes a playing card. The spots 
on a card that represent its numeric value are called “pips.” A playing card such as the 
three of spades has a pip value, 3, and a suit value, spades. We can declare the structure 
type 

struct card { 
int pips; 

char suit; 

}; 

to capture the information needed to represent a playing card. In this declaration, 
struct is a keyword, card is the structure tag name, and the variables pi ps and suit 
are members of the structure. The variable pi ps will take values from 1 to 13, repre- 
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Note that in the type definition the identifier card is used twice. In C, the name space 
for tags is separate from that of other identifiers. Thus, the type definition for card is 
appropriate. 

Within a given structure, the member names must be unique. However, members in 
different structures are allowed to have the same name. This does not create confusion 
because a member is always accessed through a structure identifier or expression. Con¬ 
sider the following code: 

struct fruit { 
char *name; 
int calories; 

}; 

struct vegetable { 
char *name; 
int calories; 

}; 

struct fruit a; 

struct vegetable b; 

Having made these declarations, it is clear that we can access a. cal ori es and b. calo- 
ries without ambiguity. 

Structures can be complicated. They can contain members that are themselves arrays 
or structures. Also, we can have arrays of structures. Before presenting some examples, 
let us give the syntax of a structure declaration: 

structure_declaration ::= struct ^specifier declaratorJist ; 

struct^specifier : := struct tag_name 

| struct tag_name opt { { member^declaration } 1+ } 

tagjname ::= identifier 

member^declaration ::= type_specifier declarator Jist ; 
declaratorjist declarator { , declarator } 0+ 

An example is 

struct card { 
int pips; 

char suit; 

} deck[52]; 


Here, the identifier deck is declared to be an array of struct card. 
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If a tag name is not supplied, then the structure type camiot be used in later declara¬ 
tions. An example is 

struct { 

int day, month, year; 

char day_name[4]; 
char month_name[4]; 

> yesterday, today, tomorrow; 

This declares yesterday, today, and tomorrow to represent three dates. Because a tag 
name is not present, more variables of this type cannot be declared later. In contrast, 
the declaration 

struct date { 

int day, month, year; 

char day_name[4]; /* Mon, Tue, Wed, etc. */ 

char month_name[4]; /* Jan, Feb,'Mar, etc. */ 

}; 

has date as a structure tag name, but no variables are declared of this type. We think of 
this as a template. We can write 

struct date yesterday, today, tomorrow; 

> * 

to declare variables of this type. 

It is usually good programming practice to associate a tag name with a structure 
type. It is both convenient for further declarations and for documentation. However, 
when using typedef to name a structure type, the tag name may be unimportant. An 
example is 

typedef struct { 
float re; 
float im; 

} complex; 

complex a, b, c[100]; 

The type complex now serves in place of the structure type. The programmer achieves 
a high degree of modularity and portability by using typedef to name such derived 
types and by storing them in header files. 
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9.2 Accessing Members of a Structure 


In this section we discuss methods for accessing members of a structure. We have 
already seen the use of the member access operator We will give further examples 
of its use and introduce the member access operator ->. 

Suppose we are writing a program called classJnfo, which generates information 
about a class of 100 students. We begin by creating a header file. 


In file class Jnfo.h 


#define CLASS SIZE 100 


struct student { 

char *last_name; 
int student_id; 
char grade; 

}; 


This header file can now be used to share information with the modules making up the 
program. Suppose in another file we write 


#include "class_info.h" 


int main(void) 

{ 

struct student tmp, class[CLASS_SIZE]; 


We can assign values to the members of the structure variable tmp by using statements 
such as 


tmp.grade = 'A*; 
tmp.last_name = "Casanova"; 
tmp.student_id = 910017; 


Now suppose we want to count the number of failing students in a given class. To do 
this, we write a function named f ai 1 () that counts the number of F grades in the array 
class []. The grade member of each element in the array of structures must be 












412 Chapter 9 Y Structures and Unions 


/* Count the failing grades. */ 

#include "class_info.h" 

int fail (struct student cl ass j']) 

{ 

int i, cnt = 0; 

for (i =0; i < CLASS_SIZE; +i 
cnt += cl ass[i].grade == 'F 
return cnt: 



Dissection of the fail () Function 

■ int fail(struct student class[]) 


i nt 


i, cnt 


The parameter cl ass is of type pointer to struct student. An equivalent declaration 
for tills parameter would be 

v * 

struct student *class 

We can think of class as a one-dimensional array of structures. Parameters of any 
type, including structure types, can be used in headers to function definitions, 

■ for (i = 0; i < CLASS_SIZE; ++i) 

We are assuming that when this function is called, an array of ty-pe struct student of 
size CLASS_SIZE will be passed as an argument. 


cnt += class[i].grade 


* i 


An expression such as this demonstrates how concise C can be. C is operator rich. To 

be fluent in its use, the programmer must be careful about precedence and associativ¬ 
ity. This statement is equivalent to 


cnt += (((class[i]).grade) == 'F'); 
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■pt The member grade of the ith element (counting from zero) of the array of structures 
cl ass is selected. A test is made to see if it is equal to ' F'. If equality holds, then the 
9 2 value of the expression 


class[i].grade == 'F’ 

Is 1, and the value of cnt is incremented. If equality does not hold, then the value of the 
expression is 0, and the value of cnt remains unchanged. 

■ return cnt; 

The number of failing grades is returned to the calling environment. 




C provides the member access operator -> to access the members of a structure via a 
pointer. (This operator is typed on the keyboard as a minus sign followed by a greater 
than sign.) If a pointer variable is assigned the address of a structure, then a member of 
the structure can be accessed by a construct of the form 

pointer_to_structure -> member_name 
A construct that is equivalent to this is 

(* pointer_to_structure ). member_narm 

The parentheses are necessary'. Along with () and [], the operators and -> have the 
highest precedence and associate from left to right. Thus, the preceding construct with¬ 
out parentheses would be equivalent to 

* (pointer Jo.structure. member_name) 

This is an error because only a structure can be used with the operator, not a pointer 
to a structure. 

Let us illustrate the use of -> by writing a function that adds complex numbers. First, 
suppose we have the following typedef in a header file: 

In file complex.h 


struct complex { 
double re; 
double im; 

}; 


/* real part */ 
/* imag part */ 
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typedef struct complex complex; 
Then in another file we write 

In file 2_adclc 


#include "complex.h" 

void add(complex *a, complex *b, 

{ 

a —> re = b —> re + c ~> re; 
a -> im - b -> im + c -> im; 


complex *c) /* a = b + c ■'/ 


Note that a, b, and c are pointers to structures. If we think of the pointers as represent¬ 
ing complex numbers, then we are adding b and c and putting the result into a. That is 
the sense of the comment that follows the header to the function definition. 

In the following table, we have illustrated some of the ways that the two member 
access operatois can be used. In the table, we assume that the user-defined type 
struct student, which we presented earlier, has already been declared. 


i Declarations and assignments 

struct student tmp, *p = &trnp; 
tmp.grade = 'A'; 
tmp.last_name = "Casanova"; 
tmp.student_id = 910017; 

Expression 

Equivalent expression 

Conceptual value 

tmp.grade 

p -> grade 

A. 

tmp.last_name 

p -> last_name 

Casanova 

(*p).student_id 

p --> student......! d 

910017 

* p -> last_name + 1 

(*(p -> last_name)) + 1 

D 

* Cp -> last_name + 2) 

(p -> last_name)[2] 

s 



9.3 Operator Precedence and Associativity: 

A Final Look 

We now want to display the entire precedence and associativity table for all the C oper¬ 
ators. The operators “. ” and -> have been introduced in this chapter. These operators, 
together with () and [], have the highest precedence. 


uperatoi s 

Associativity 

() [] . -> ++ C postfix ) — (.postfix) 

left to right 

++ ( prefix ) — ( prefix. ) ! - sizeof (type) 

4* Cunary) - Cunary) & Caddress) * (dereference) 

rkihf tn 1 

i tv lvl V 

* / % 

left to right 

+ 

left to right 


] pk f'f T A 1*1 0-K t 

II 

A 

A 

II 

V 

V 

M*. k Ay .4*,. .4*4 -k .A. 

== ! ™ 

left to right 

& 

left to right 

A 

left to right 

l 

left to right 

&& 

left to right 

11 

left to right 

?: 

right to left 

= %= »= «= Si- A= | = 

right to left 

, Ccomma operator) 

left to right 


Although the complete table of operators is extensive, some simple rules apply. The 
primary operators are function parentheses, subscripting, and the two member access 
operators. These four operators are of highest precedence. Unary operators come next, 
followed by the arithmetic operators. Arithmetic operators follow the usual convention; 
namely, multiplicative operators have higher precedence than additive operators. 
Assignments of all kinds are of lowest precedence, with the exception of the still more 
lowly comma operator. If a programmer does not know the rules of precedence and 
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associativity in a particular situation, he or she should either look up the rules or use 
parentheses. If you work at a particular location on a regular basis, you should consider 
copying this table and pasting it on the wall right next to where you work. 



9.4 Using Structures with Functions 


In C, structures can be passed as arguments to functions and can be returned from 
them. When a structure is passed as an argument to a function, it is passed by value, 
meaning that a local copy is made for use in the body of the function. If a member of 
the structure is an array, then the array gets copied as well. If the structure has many 
members, or members that are large arrays, then passing the structure as an argument 
can be relatively inefficient. For most applications, an alternate scheme is to write func¬ 
tions that take an address of the structure as an argument instead. 

Business applications often use structures that have lots of members. Let us imagine 
that we have such a structure: 


typedef struct { 
char 
int 

struct dept 

struct home_address 

double 

* < « I B 

} employee_data; 


name[25]; 
employee_id; 
department; 
*a_ptr; 
salary; 


Notice that the department member is itself a structure. Because the compiler has to 

know the size of each member, the declaration for struct dept has to come first. Let 
us suppose that it was given by 

struct dept { 

char dept_name[25]; 
int dept_no; 

} i 


Notice that the a_ptr member in the type definition of empl oyee_data is a pointer to a 

structure. Because the compiler already knows the size of a pointer, this structure need 
not be defined first. 

Now, suppose we want to write a function to update employee information. There are 
two ways we can proceed. The first way is as follows: 



emp!oyee_data update(employee_data e) 

{ 


printf("Input the department number: "); 
scanf("%d", &n); 
e.department.dept_no = n; 

return e; 

} 

Notice that we are accessing a member of a structure within a structure, because 

e.department.dept_no is equivalent to (e.department).dept_no 
To use the function update (), w r e could write in mai n() or in some other function 

employee_data e; 
e = update(e); 

Here, e is being passed by value, causing a local copy of e to be used in the body of the 
function; when a structure is returned from update(), it is assigned to e, causing a 
member-by-member copy to be performed. Because the structure is large, the compiler 
must do a lot of copy work. An alternate way to proceed is to write 

void update(employee_data *p) 

{ 


printf("Input the department number: "); 
scanf("%d", &n); 
p -> department.dept_.no = n; 


In this example, the construct 

p -> department.dept_.no is equivalent to (p -> department) .dept_.no 

This illustrates how a member of a structure within a structure can be accessed via a 
pointer. To use this version of the update () function, we could write in main() or in 
some other function 

employee_data e; 
update(&e); 

Here, the address of e is being passed, so no local copy of the structure is needed 
within the update () function. For most applications this is the more efficient of the 
two methods. (See exercise 5, on page 438, for further discussion.) 











418 Chapter 9 ▼ Structures and Unions 



9.5 Initialization of Structures 


All external and static variables, including structures, that are not explicitly initialized 
by the programmer are automatically initialized by the system to zero. In traditional C, 
only external and static variables can be initialized. ANSI C allows automatic variables, 
including structures, to be initialized as well. 

The syntax for initializing structures is similar to that for initializing arrays. A struc¬ 
ture variable in a declaration can be initialized by following it with an equal sign and a 
list of constants contained within braces. If not enough values are used to assign all the 
members of the structure, the remaining members are assigned the value zero by 
default. Some examples are 

card c = {13, 1 h ? }; /* the king of hearts */ 

complex a[31[3] = { 

{{1.0, -0.1}, {2.0, 0.2}, {3.0, 0.3}}, 

{{4.0, -0.4}, {5.0, 0.5}, {6.0, 0.6}}, 

}; /* a[2][] is assigned zeroes */ 

struct fruit frt = {"plum", 150}; 

struct home_address { 
char *street; 
char *city_and_state; 
long zip_code; 

} address = {"87 West Street", "Aspen, Colorado", 80526}; 

r 

struct home_address previous_address = {0}; 

The last example illustrates a convenient way to initialize all members of a structure to 
have value zero. It causes pointer members to be initialized with the pointer value NULL 
and array members to have their elements initialized to zero. 
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9.6 An Example: Playing Poker 

Let us use the concepts that we have presented thus far in this chapter to write a pro¬ 
gram to play poker. The program will compute the probability that a flush is dealt, 
meaning that all five cards in a hand are of the same suit. In the exercises, we will dis¬ 
cuss ways of extending the program. Caution: We are going to redesign our card struc¬ 
ture. 

Our program consists of many small functions. The simplest scheme is to put them 
all in one file. We will present the functions one after another as they occur in the file. 
Where appropriate, we will discuss key ideas. Here is what occurs at the top of the file: 

In file poker.c 

#include <stdio.h> 

#include <std!ib.h> 

#include <time.h> 

#define NDEALS 3000 /* number of deals */ 

#define NPLAYERS 6 /* number of players */ 

typedef enum {clubs, diamonds, hearts, spades} cdhs; 

struct card { 
int pips; 
cdhs suit; 

}; 

typedef struct card card; 

We are envisioning a game with six players. Because we are interested in the probability 
of a flush occurring, we want to deal many hands. Notice that the member sui t is an 
enumeration type. As we will see, the use of the enumeration type makes the code very 
readable. Next in the file comes the function prototypes: 

card assign__values(int pips, cdhs suit); 
void prn_card_values(card *c_ptr); 
void play_poker(card deck[52]); 
void shuffle(card deck[52]); 

void swap(card *p, card *q); 

void deal_the__cards(card deck[52], card hand[NPLAYERS][5]); 
int i s__flush(card h[5]); 
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We begin our program development with main(). Every time we need a new function, 

we write it at the bottom of the file and put its function prototype in the list just above 
main(). 


int main(void) 

cdhs suit; 
int i, pips; 
card deck[52]; 

for (i = 0; i < 52; ++i) { 
pips = i % 13 + 1; 
if (i < 13) 
suit = clubs; 
else if (i < 26) 
suit = diamonds; 
else if (i < 39) 
suit = hearts; 
el se 

suit = spades; 

deck[i] = assign_values(pips, suit); 

} 

for (i =26; i <39; ++i) /* pr in 

prn_card_values(&deck[i]); 
play_poker(deck); 
return 0; 


/* print out the hearts */ 


In mai n(), we first assign values to each element in the array deck. We think of these 

elements as playing cards in a deck. To check that the code works as expected, we print 
out the hearts. 

card assign_values(int pips, cdhs suit) 
card c; 

c.pips = pips; 
c.suit = suit; 
return c; 


Notice that the identifiers pi ps and suit are parameters in the function definition as 
well as members of the structure c. This sort of dual use of names is common. 
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void prn_card_values(card *c_ptr) 

{ 

int pips = c_ptr -> pips; 
cdhs suit = c_ptr -> suit; 
char *suit_name; 

if (suit == clubs) 

suit_name = ''clubs"; 
else if (suit == diamonds) 
suit_name = "diamonds"; 
else if (suit == hearts) 
suit_name = "hearts"; 
else if (suit == spades) 
suit_name = "spades"; 

printf("card: %2d of %s\n", pips, suit_name); 

} 

In the function prn_card_val ues(), we use pi ps and suit as local variables. The com¬ 
piler cannot confuse these identifiers with the members of card because members of a 
structure can be accessed only via the and -> operators. 

Our next function is play_poker(). Because it is central to this program and con¬ 
tains some crucial ideas, we will look at it in detail. 

void play_poker(card deck[52]) 

{ 

int flush_cnt = 0, hand_cnt = 0; 

» . i * 

int i,3; 

card hand[NPLAYERS][5]; /* each player dealt 5 cards */ 

srand(time(NULL)); /* seed random-number generator */ 

for (i =0; i < NDEALS; ++i) { 
shuffle(deck); 
deal_the_cards(deck, hand); 
for (j = 0; j < NPLAYERS; ++j) { 

++hand_cnt; 

if (is_flush(hand[j])) { 

++flush_cnt; 

printf("%s%d\n%s%d\n%s%f\n\n", 

" Hand number: ", hand_cnt, 

" Flush number: ", flush_cnt, 

"Flush probability: ", 

(double) flush_cnt / hand_cnt); 
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Dissection of the play_poker() Function 


■ card hand[NPLAYERS] [5] ; /* each player dealt 5 cards */ 

The identifier hand is a two-dimensional array. It can be viewed also as an array of 
arrays. Because the symbolic constant NPLAYERS has value 6, w'e can think of hand as 
six hands, each with five cards. 

■ srand(time(NULL)); /* seed random-number generator */ 

The function srandQ is used to seed the random-number generator invoked by 
rand(). The value supplied as an argument to srandO is called the seed. Here, w'e 
have used for the seed the value returned by the function call time (NULL). This func¬ 
tion is in the standard library, and its function prototype is in the header file time.h. 
Because time() produces an integer value that is obtained from the internal clock, each 
time w'e run the program a different seed is produced, causing different numbers to be 
generated by rand (). Hence, every time we run the program different hands are dealt. 

■ for (i = 0; i < NDEALS ; ++i) { 

shuffle(deck); 
deal_the_cards(deck, hand); 

Before dealing the cards, we shuffle the deck. As we will see, this is where we use 
rand () to simulate the shuffling. 

■ for (j = 0; j < NPLAYERS; ++j) { 

++hand_cnt; 

if (is_flush(hand[j])) { 

++flush_cnt; 


After the cards are dealt, we check each player’s hand to see if it is a flush. If it is, then 
we increment f 1 ush_cnt and print. 

1 

The function that shuffles the cards comes next. It is quite simple. We go through the 
deck swapping each card with another card that is chosen randomly. Note that the 52 in 
the header to the function definition is only for the benefit of the human reader. The 
compiler disregards it. 



9.6 T An Example: Playing Poker 423 


void shuffle(card deck[52]) 

{ 

int i, j; 

for (i =0; i <52; ++i) { 
j = randQ % 52; 
swap(&deck[i], &deck[j]) 

} 


void swap(card *p, card *q) 

{ 

card tmp; 
tmp = *p; 

*p = *q; 


tmp 
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Even though the swapO function is used to swap structures, its general form is the 


same. 


void deal_the_cards(card deck[52], card hand[NPLAYERS][5]) 

{ , cx ■ ■ 

int card_cnt = 0, i, 3 I 

for (j =0; j < 5; ++j) 

for (i =0; i < NPLAYERS; ++i) 
hand[i][j] = deck[card_cnt++]; 


Because the compiler disregards the first size in an array in a parameter declaration, an 
equivalent header for this function definition is 

void deal_the_cards(card deck[], card hand[][5]) 

Also because arrays in parameter declarations are really pointers, anothei equivalent 
header is given by 

void deal_the_cards(card *deck, card (*hand)[5]) 

In each of these headers, the 5 is needed by the compiler to generate the correct storage 
mapping function. Notice that in the function the cards are dealt one at a time to each 
of the hands in turn. In contrast, the code 
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for (i = 0 ; i < NPLAYERS; ++i) 
for (j = 0 ; j <5; ++j) 

hand[i][j] = deck[cnt++]; 

would have the effect of dealing five cards all at once to each of the hands. Because we 
are trying to simulate a poker game as it is actually played, the code that we used in 
deal_the_cards() is preferable. 

int is_flush(card h[5]) 

{ 

int i ; 

for (i =1; i <5; ++i) 

if (h[i].suit != h[ 0 ].suit) 
return 0 ; 
return 1 ; 

> 

The function i s_fl ush() checks to see if the cards in a hand are all of the same suit. It 
is invoked in the function play_poker() with the expression is_flush(hand[j]). 
Recall that hand is an array of arrays, making hand[j] itself an array of type card. 
When i s_fi ush Q is called, the elements of this array are accessed. 


9.7 Unions 

A union, like a structure, is a derived type. Unions follow the same syntax as structures 
but have members that share storage. A union type defines a set of alternative values 
that may be stored in a shared portion of memory. The programmer is responsible for 
interpreting the stored values correctly. Consider the declaration 

union int_or_float { 
int i ; 
float f; 

}; 


In this declaration, union is a keyword, int_or_float is the union tag name, and the 
variables i and f are members of the union. This declaration creates the derived data 
type uni on i nt_or_float. The declaration can be thought of as a template; it creates 
the type, but no storage is allocated. The tag name, along with the keyword union, can 
now be used to declare variables of this type. 
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union int_or_float a, b, c; 

This declaration allocates storage for the identifiers a, b, and c. For each variable the 
compiler allocates a piece of storage that can accommodate the largest of the specified 
members. The notation used to access a member of a union is identical to that used to 

access a member of a structure. 

The following example shows how what is stored in a union can be interpreted in 
ferent ways: 


In file numbers.c 

typedef union int_or_float { 
int i; 
float f; 

} number; 

int main(void) 

{ 

number n; 

\f\ *j =3 4444 \ 

printfC'i: %10d f: %16.10e\n”, n.i, n.f); 
n .f = 4444 . 0 ; 

printfC'i- % 10 d f: %16.10e\n , n.i, n.f), 

return 0 ; 

} 

This little experiment demonstrates how your system overlays an i nt and a f 1 oat. The 
output of this program is system-dependent. Here is what is printed on our system: 

i: 4444 f: 6,227370375e-41 

i: 1166729216 f: 4.4440000000e+03 



Dissection of the inLor_float Program 


■ typedef union int„or_float { 
int i; 
float f; 

} number; 

As with structures, the typedef mechanism can be used to abbreviate long type names 
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■ number n; 

I he type number is equivalent to union int_or_f1oat. This declaration causes the 
system to allocate space for the union variable n. That space must accommodate the 
larger of the two members of n. If we assume that an i nt is stored in 2 or 4 bytes and 
that a f 1 oat is stored in 4 bytes, then the system will allocate 4 bytes for n. 

■ n.i = 4444; 

printf("i; %i0d f : %l6.10e\n", n.i, n.f); 

The i nt member n.i is assigned the i nt value 4444. When we print the value of n. i in 
the format of a decimal integer, 4444, of course, is printed. However, the same piece of 
memory is interpreted as a float when we print the member n. f in the %e format. In 
this case, something radically different from 4444 is printed. 

■ n.f = 4444.0; 

printfC’i; %10d f: %16.10e\n'\ n.i, n.f); 

Now we are playing the game the other way. The f 1 oat member n. f is assigned the 
value 4444.0. When n. f is printed in the %e format, we get the correct value, but when 
we print the underlying storage in the %d format, we get something quite different. 

The point is that the system will interpret the same stored values according to which 

member component is selected. It is the programmer’s responsibility to choose the 
right one. 

Unions are used in applications that require multiple interpretations for a given piece 
of memory. But more commonly, they are used to storage by allowing the 

same space in memory to be used for a variety of types. The members of a union can be 
structures or other unions. The following is an example: 

struct flower { 
char 

enum {red, white, blue} 

>; 

struct fruit { 
char *name; 
int calories; 

}; 


*name; 

color; 
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struct vegetable { 
char *name; 
int calories; 

int cooking_time; 

}; 


/* in minutes */ 


union flower_fruit_or_vegetable { 
struct flower flw; 

struct fruit frt; 

struct vegetable veg; 

}; 

union flower_fruit_or_vegetable ffv; 

Having made these declarations, we can use a statement such as 

ffv.veg.cooking_time =7; 

to assign a value to the member cooki ng_time of the member veg in the union ffv. 


.8 Bit Fields 

An i nt or unsigned member of a structure or union can be declared to consist of a 
specified number of bits. Such a member is called a bit field, and the number of associ¬ 
ated bits is called its width. The width is specified by a nonnegative constant integral 
expression following a colon. The width is at most the number of bits in a machine 
word. Typically, bit fields are declared as consecutive members of a structure, and the 
compiler packs them into a minimal number of machine words. An example is 


struct pcard { 
unsigned pips 
unsigned suit 

}; 


/* a packed representation */ 


A variable of type struct pcard has a 4-bit field called pips that is capable of storing 
the 16 values 0 to 15, and a 2-bit field called sui t that is capable of storing the values 
0, 1, 2, and 3, which can be used to represent clubs, diamonds, hearts, and spades, 
respectively. Thus, the thirteen pips values and the four suit values needed to repre¬ 
sent playing cards can be represented compactly with 6 bits. Suppose that we make the 
declaration 
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struct pcard c; 

To assign to c the nine of diamonds, we can write 

c.pips = 9; 
c.suit = 1; 

The syntax for a bit field within a structure or union is given by 

bit_field_member ;:= { int I unsigned } x { identifier } opt 
expr : := constantJntegraLexpression 
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9,9 V An Example: Accessing Bits and Bytes 4zy 


1 tother way to cause alignment to a next word is to use an unnamed bit field with zero 
dth. Consider the code 

'"unsized 1 a : 1, : 0, b : 1, : 0. c : 1; 

}; 

this creates three 1-bit fields in three separate words. 


Whether the compiler assigns the bits in left-to-right or right-to-ieft order is machine- 
dependent. Although some machines assign bit fields across word boundaries, most of' 
them do not. Thus, on a machine with 4-byte words, the declaration 

•%-Pi 

struct abc { ‘fl 

int a : 1, b : 16, c : 16; H 

} x ; H 

typically would cause x to be stored in two words, with a and b stored in the first word 
and c stored in the second. Only nonnegative values can be stored in unsigned bit 
fields. For int bit fields, what happens is system-dependent. On some systems the 
high-order bit in the field is treated as the sign bit. (Fee exercise 25, on page 443.) In 
most applications, unsigned bit fields are used, 4 

The chief reason for using bit fields is to conserve memory. On machines with 4-byte 
words, we can store 32 1-bit variables in a single word. Alternatively, we could use 32 
char variables. The amount of memory saved by using bit fields can be substantial. T 
There are some restrictions, however. Arrays of bit fields are not allowed. Also, the 
address operator & cannot be applied to bit fields. This means that a pointer cannot be 
used to address a bit field directly, although use of the member access operator -> is 
acceptable. 

One last point: Unnamed bit fields can be used for padding and alignment purposes. 
Suppose our machine has 4-byte words, and suppose we want to have a structure that 
contains six 7-bit fields with three of the bit fields in the first word and three in second. 
The following declaration accomplishes this: 
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9.9 An Example: Accessing Bits and Bytes 


In this section we give an example of how the bits and bytes of a word in memory car 
Up accessed One way we can do this is through the use of bitwise operators and masks 
explained hr^Section 7.1, “Bitwise Operators and Expressions,” on page 331 and Sec 
l t ion 7 2, “Masks,” on page 337. Here, we illustrate how it can be done using it le c s 

Our program will be for machines having 4-byte words. 

In file check_bits.c 
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#include <stdio.h> 

typedef struct { 
unsigned b0 : 
} word_bytes; 

typedef struct { 
unsigned 

b0 : 1, bl 
b7 : 1, b8 
bl4: 1, bl5 
b21: 1, b22 
b28: 1, b29 
} word_bits; 


8, bl : 8, b2 : 8, b3 : 8 


1, b2 
1, b9 
1 , bl6 
1, b23 
1, b30 


1, b3 
1, bl0 
1, bl7 
1, b24 
1, b31; 


1, b4 
1, bll 
l, bi8 
1, b25 


1, b5 : 1> b6 * 1 

1, bl2 : 1, bl3 : 1 

1, bl9 : 1, b20 : 1 

1, b26 : 1, b27 : 1 


struct small_integers { 

unsigned il : 7, i2 : 7, i3 : 7, 

: 11 , 

i4 : 7. i 5 : 7. i6 : 7: 
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/* align to next word */ 
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typedef union { 
int 1 > 

word_bits bit; 

word_bytes byte; 
1 word: 
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Programmer-defined types are frequently implemented with structures. In this sec¬ 
tion, we develop and implement the ADT stack, one of the most useful standard data 
structures. A stack is a data structure that allows insertion and deletion of data to 
occur only at a single restricted element, the top of the stack. This is the last-in-first-out 
(LIFO) discipline. Conceptually, a stack behaves like a pile of trays that pops up or is 
pushed down when trays are removed or added. The typical operations that can be 
used with a stack are push, pop, top, empty>, full, and reset. The push operator places a 
value on the stack. The pop operator retrieves and deletes a value off the stack. The top 
operator returns the top value from the stack. The empty operator tests if th*e stack is 
empty. The full operator tests if the stack is full. The reset operator clears the stack, or 
initializes it. The stack, along with these operations, is a typical ADT. 

We will use a fixed-length char array to store the contents of the stack. (Other imple¬ 
mentation choices are possible; in Section 10.5, “Stacks,” on page 461, we will imple¬ 
ment a stack as a linked list.) The top of the stack will be an integer-valued member 
named top. The various stack operations will be implemented as functions, each of 
whose parameter lists includes a parameter of type pointer to stack. By using a 
pointer, we avoid copying a potentially large stack to perform a simple operation. 


In file stack.c 


/* An implementation of type stack. */ 


#define 
#define 
#define 


MAX„LEN 

EMPTY 

FULL 


1000 

-1 

(MAX_LEN 


typedef enum boolean {false, true} boolean; 

typedef struct stack { 
char s[MAX_LEN]; 
int top; 

} stack; 

void reset(stack *stk) 

{ 

stk -> top = EMPTY; 

} 

void push(char c, stack *stk) 

{ 

stk -> top++; 

stk -> s[stk -> top] = c; 
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char pop(stack *stk) 

{ 

return (stk -> s[stk -> top—]); 

} 

char topCconst stack *stk) 

{ 

return (stk -> s[stk -> top]); 

} 

boolean empty(const stack *stk) 

{ 

return ((boolean) (stk -> top == 

} 
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boolean full(const stack *stk) 

return ((boolean) (stk -> top == FULL)); 
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Dissection of the stack Implementation 

■ typedef enum boolean {false, true} boolean; 

We use this typedef to give a new name, boolean, to the enumeration type, enum 

boolean. Note that the new name coincides with the tag name. Programmers often do 
this. 

■ typedef struct stack { 

char s[MAX_LEN]; 
int top; 

} stack; 

This code declares the structure type struct stack, and at the same time uses a 

typedef to give struct stack the new name stack. Here is an equivalent way to write 
this: 

struct stack { 

char s[MAX_LEN]; 
int top; 

}; 

typedef struct stack stack: 
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The structure has two members, the array member s and the i nt member top. 

void reset(stacl< *stk) 

{ 

stk -> top = EMPTY; 

} 

The member top in the stack pointed to by stk is assigned the value EMPTY. This con¬ 
ceptually resets the stack, making it empty. In the calling environment, if st is a stack, 
we can write 

reset(&st); 

to reset st or initialize it. At the beginning of a program, we usually start with an empty 
stack. 

void push(char c, stack *stk) 

{ 

stk -> top++; 
stk -> s[stk -> top] = c; 

} 

char pop(stack* stk) 

{ 

return (stk -> s[stk -> top—]); 

} 

The operation push is implemented as a function of two arguments. First, the member 
top is incremented. Note that 

stk -> top++ is equivalent to (stk -> top)++ 

Then the value of c is shoved onto the top of the stack. This function assumes that the 
stack is not full. The operation pop is implemented in like fashion. It assumes the stack 
is not empty. The value of the expression 

4 -* I jt ia 

StK -> top— 

is the value that is currently stored in the member top. Suppose this value is 7. Then 
stk -> s[7] 

gets returned, and the stored value of top in memory gets decremented, making its 
value 6. 
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boolean empty(const stack *stk) 

{ 

return ((boolean) (stk -> top 

} 

boolean full(const stack *stk) 
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EMPTY)); 


FULL)); 


Each of these functions tests the stack member top for an appropriate condition and 
returns a value of type boolean, either true or false. Suppose the expression 



j ?i 
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stk -> top 


EMPTY 


in the body of empty () is true. Then the expression has the i nt value 1, This value get 
cast to the type boolean, making it true, and that is what gets returned. 




1 o test our stack implementation, we can put the preceding code in a .c file and add 
the following code at the bottom. Our function mai n() enters the characters of a strinj 
onto a stack and then pops them, printing each character out in turn. The effect is t< 
print in reverse order the characters that were pushed onto the stack. 

In file stack.c 

/" Test the stack implementation by reversing a string. */ 

#include <stdio.h> 




int main(void) 

{ 

char str[] = "My name is Laura Pohl!"; 

int i; 

C 'f* 3 1/ c* it 

Lcldk b , 

reset(&s); /* initialize the stack */ 

pnntf(" In the string: %s\n", str); 
for (i = 0; str[iJ != *\0•; ++i) 
if (!ful1(&s)) 


tpf : 
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push(str[i], &s); 
printf("From the stack: ") 
while (!empty(&s)) 
putchar(pop(&s)); 
putchar('\n'); 
return 0; 


/* push a char on the stack */ 
/* pop a char off the stack */ 



The output from this test program is 

S' ' 1 

In the string: My name is Laura Pohl! 

From the stack; IlhoP aruaL si eman yM 

Note that the expression &s, the address of the stack variable s, is used as an argu¬ 
ment whenever we call a stack function. Because each of these functions expects a 
pointer of type stack *, the expression &s is appropriate. 



Summary 

1 Structures and unions are principal methods by which the programmer can define 
new types. 

2 The typedef facility can be used to give new names to types. Programmers rou¬ 
tinely use typedef to give a new name to a structure or union type. 

3 A structure is an aggregation of components that can be treated as a single variable. 
The components of the structure are called members. 

4 Members of structures can be accessed by using the member access operator If 
s is a structure variable with a member named m, then the expression s .m refers to 
the value of the member m within the structure s. 

5 Members of structures can also be accessed by the member access operator ->. If p 
is a pointer that has been assigned the value &s, then the expression p -> m also 
refers to s.m. Both and -> have highest precedence among C operators. 

6 In ANSI C, if a and b are two variables of the same structure type, then the assign¬ 
ment expression a = b is valid. It causes each member of a to be assigned the value 
of the corresponding member of b. Also, a structure expression can be passed as an 
argument to a function and returned from a function. (Many traditional C compilers 
also have these capabilities, but not all of them.) 

7 When a structure variable is passed as an argument to a function, it is passed “call- 
by-value.” If the structure has many members, or members that are large arrays, 
this may be an inefficient way of getting the job clone. If we redesign the function 
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definition so that a pointer to the structure instead of the structure itself is 
then a local copy of the structure will not be created. 

8 L7r!° n iS like 3 structure - exce P* that the members of a union share the sa»3 
P i f 11 memory ' Unions are used principally to conserve memory. The space a iW 

members of^uihon h^i °th ^ ^ ^ * V3riety of types ’ s P ecitied by the 

• 1 on ‘ 111S the Programmer s responsibility to know which renr^ 

sentation is currently stored in a union variable. P e ' 
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9 The members of structures and unions can be arrays or other struct,,™ a J • 
unions. Considerable complexity is possible when nesting arrays, structures 

ST" each othcr - Care must be takenthat the A« #g§ 
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>picaJly as contiguous bits in a machine word, provided that they fit. f 

11 men^rmrnose^ 6 ,^. nnamed ’ m which case they are used for padding or word align- 

ment purposes. The unnamed bit field of width 0 is special. It causes immediate 
alignment on the next word. immediate 

12 nnTit'f 18 ff/^mented is system-dependent. Hence, their use need not be 
P le. Nonetheless, bit fields have important uses. 

13 The stack is an abstract data type (ADT) that has many uses, especially in computer ' 

science. An ADT can be implemented many different ways. computer 
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In some situations a typedef can be replaced by a #def i ne. Here is an example: 

typedef float DOLLARS; 

int main(void) 

{ 

DOLLARS amount = 100.0, interest = 0.07 * amount; 


printf("DOLLARS 
return 0; 


= %.2f\n", amount + interest); 


Execute this program so you understand its effects, then replace the typedef by 
#define DOLLARS float 

When you recompile the program and execute it, does it behave as it did before? 

In some situations a typedef cannot be replaced by a #define. Consider the fol¬ 
lowing program; 

typedef float DOLLARS; 

int main(void) 

{ 

DOLLARS amount = 100.00, 

interest =0.07 * amount; 

{ /* new block */ 

float DOLLARS; 

DOLLARS = amount + interest; 

printf("DOLLARS = %.2f\n", DOLLARS); 

} 

return 0; 
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Execute this program so that you understand its effects. If the typedef is replaced 
by the line 


■s 


#define DOLLARS float 
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3 The program in the previous exercise used a new block to illustrate a certain point. 
Because of this, the code was not very natural. In this exercise, we use a typedef 
again, and this time the code is quite straightforward. 

typedef char * string; 
int main(void) 

{ ' 1 

string a[] = {"I", "like", "to", "fight,"}, 
b[] = {"pinch,", "and", "bite."}; 

printf("%s %s %s %s %s %s %s\n", i 

a[0], a[l], a[2], a[3], b[0], b[l], b[2]); 
return 0; 

} 

Execute the program so that you understand its effects. If the typedef is replaced 
by the line 

#define string char * 

then the program will not compile. Explain why the typedef works, but the 
#defi ne does not. To make the program work with the #def i ne, a single character 
needs to be added. What is the character? 

4 Write a function add (a, b, c), where the variables a, b, and c are of type matrix. 
Write a program to test your function. 

5 Speed is not everything, but it counts for a lot in the computing world. If a structure 
is large, it is more efficient to pass an address of the structure than it is to pass the 
structure itself. Does this hold true if the structure is small? After all, there is a cost 
for dealing with pointers, too. Suppose you have many complex numbers to multi¬ 
ply. How should you design your multiplication function? You could pass struc¬ 
tures as arguments: 

complex mult(complex b, complex c) 

{ 

complex a; 

a . re = b . re * c . re - b . i m * c . i m; 
a . i m = b . i m * c. re + b. re * c. i m; 
return a; 
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But it may be faster to pass addresses: 

void mult(complex *a, complex *b, complex *c) 

{ 

* * * tt « 

Complete this version of the mult() function, and write a test program to see 
which version is faster. 

6 The program in this exercise uses a typedef to name the type “pointer to a func¬ 
tion that takes a double and returns a double.” First, run the program so that you 
understand its effects. 

#include <stdio.h> 

#include <math.h> 

fdefine PI 3.14159 

typedef double dbl; /* create an abbreviation */ 

typedef dbl (*PFDD)(dbl); /* ptr to fct taking dbl 

and returning a dbl */ 

int main(void) 

{ 

PFDD f = sin, g = cos; 

printf("f(%f) = %f\n", PI, f(PI)); 
printf(”g(%f) = %f\n", PI, g(PI)); 
return 0; 

} 

Now modify the code so that the second typedef is replaced by 

typedef dbl FDD(dbl); /* fct of dbl returning a dbl */ 

typedef FDD *PFDD; /* ptr to FDD */ 

Does the program still run? (It should.) Carefully explain how the typedef for PFDD 


7 Write a function to do complex subtraction. Write two versions: one that returns a 
pointer to compl ex and one that returns a value of type comp! ex. 

8 Write a small collection of functions that perform operations on complex vectors 
and matrices. Include in your collection functions that print complex vectors and 
matrices on the screen. Write a program to test your functions. 
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9 Write a program that is able to produce a balanced meal. Start by defining a stru 
ture that contains the name of a food, its calories per serving, its food type such a 
meat or fruit, and its costs. The foods should be stored as an array of structures 
The program should construct a meal that comes from four different food types 
and that meets calorie and cost constraints. The program should be capable of pro 
during a large number of different menus. 

10 Create a structure that can describe a restaurant. It should have members that 
include the name, address, average cost, and type of food. Write a routine that 
prints out all restaurants of a given food type in order of cost, with the least costly 
first. 

11 Put a list of student names, identification numbers, and grades into a file called 
data. For example, the beginning of the file might look like ' 
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Write a program called reorder that can be used to read the data in the file and put 
it into the array cl ass of type struct student. This can be done with redirection 

reorder < data 

The program should print out an ordered list of students and grades. Students with 
A grades should be listed first, students with B grades next, and so forth. Among all 
students having the same grade, the students should be listed alphabetically by 
name. 

12 Modify the program that you wrote in the previous exercise by adding the function 
class_averageO to compute the class average and print it. Assume that an A 
grade has value 4, a B grade has value 3, and so forth. 

13 Experiment with the poker program. After dealing many hands, the last computed 
probability of getting a flush should approximate the mathematical probability. 
What is the mathematical probability of getting a flush? If you arc dealt one card, 
then it will be of a certain suit, say hearts. What is the chance that the second card 
you are dealt will be a heart? Well, there are 31 cards left in the deck and 12 of them 
are hearts, so the probability is 12/51. What is the chance that the third card you 
are dealt will also be a heart? Because there are now 7 50 cards left in the deck and 11 
of them are hearts, the probability that the second and third cards are hearts is 
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(12 / bl) x (11 / 50). Continuing in this fashion, we see that the mathematical prob¬ 
ability of getting a flush is 

12 11 10 9 

51 X 50 X 49 X 48 

This product is approximately 0.00198. (Use your hand calculator to check that this 
is so.) When your machine plays poker, some of the computed probabilities should 
be larger than this number and some smaller. Is that the case? 

14 Modify the poker program by adding the function i s_strai ght(), which tests 
whether a poker hand is a straight. Whenever a straight is dealt, print the computed 
probability. Because a flush beats a’ straight, we expect the probability of a flush to 
be lower than the probability of a straight . Does your poker program confirm this? 

15 Modify the poker program by adding the function i s_ful 1 house (), which tests 
whether a poker hand is a full house. Whenever a full house is dealt, print the com¬ 
puted probability. Because a full house beats a flush, we expect the probability of a 
full house to be lower than that of a flush. Does your poker program confirm this? 

16 The experienced poker player arranges the hand to reflect its values. Write a pro¬ 
gram that arranges and prints out a hand of five cards in sorted order by pips 
value. Assume that an ace is highest in value, a Icing is next highest in value, and so 

17 (Advanced) Write a function hand_val ue(), which returns the poker value of the 
hand. The best possible hand is a straight flush, and the worst possible hand is no 
pair. Write another function that compares two hands to see which is best. 

18 Consider the following version of the function assign_values(): 

void assign_values(card c, int pips, cdhs suit) 

{ 

c.pips = pips; 
c.suit = suit; 

} 

When we test this function with the following code we find that it does not work as 
expected. Explain. 

card c; 

assign_values(c, 13, diamonds); /* the king of diamonds */ 
prn_card_values(&c); 
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19 Consider the following union declaration: I 

union a { 1 

int a; % 

char b; % 

float a; ,*j 

} a, b, c; 

There is only one thing wrong. What is it? 

20 Write a typedef for a structure that has two members. One member should be 1 

union of double and complex, and the other member should be a “flag” that tells 
which domain is being used. Write functions that can add and multiply over both 
domains. Your functions should decide on appropriate conversions when mixing 
arguments from both domains. ' 

21 In commercial applications, it is sometimes useful to use binary coded decimal 
codes (BCD), where 4 bits are used to represent a decimal digit. A 32-bit word can 
be used to represent an 8-digit decimal number. Use bit fields to implement this 
code. Write two conversion routines, one from binary to BCD and the second from 
BCD to binary. 


22 In Section 4.10, “An Example: Boolean Variables,” on page 170, we wrote a program 
that printed a table of values for some boolean functions. Rewrite that program 
using bit fields. Represent each boolean variable as a 1 -bit field. 


(Sieve of Eratosthenes) Use an array of structures, where each structure contains a 
word that is divided into bit fields, to implement the sieve algorithm for primes. On 
a machine with 4-byte words, we can use an array of 1,000 elements to find all 
primes less than 32,000. Let each bit represent an integer, and initialize all bits to 
zero. The idea is to cross out all the multiples of a given prime. A bit that is 1 will 
represent a comuosite number. Start with the prime 2. Every second bit is made 1 
starting wun dii 4 . men mt .i is son u. every third bit is made 1, starting with bit 6. 
Bit 4 is 1, so it is skipped. Bit 5 is the next bit with value 0. Every fifth bit is made 1, 
starting with bit 10. At the end of this process, only those bits that are still zero 
represent primes. 
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24 What gets printed by the following program? Explain. 

#include <stdio.h> 
struct test { 

unsigned a : 1, b : 2, c : 3; 

}; 

int main(void) 

{ 

i nt i; 

struct test x; 

for (i =0; i <23; ++i) { 
x.a = x.b = x.c = i; 

printfCx.a = %d x.b = %d x.c = %d\n", x.a, x.b, x.c); 

} 

return 0; 

} 

What happens if you replace 


x.a = x.b = x.c = i; by x.c = x.b --- x.a = i; 

25 Does your system implement signed arithmetic for i nt bit fields? Try the following 
code: 


struct test { 

int a : 3, b : 4; 

} x = {0}; 

for ( ; ; ) 

printf("x.a = %2d x.b = %2d\n", x.a++, x.b++); 
26 What gets printed? Explain. 

typedef struct a { unsigned a : 1, : 0, b : 1; } a; 
typedef struct b { unsigned a : 1, b : 1; } b; 

printf("%.lf\n", (float) sizeof(a) / sizeof(b)); 
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27 In this exercise, we want to consider = and == with respect to their use with strufl* 
tures. Suppose a and b are two structure variables of the same type. Then tfffjg 
expression a * bis valid, but the expression a == b is not. The operands of the =jjjp 
operator cannot be structures. (Beginning programmers often overlook this subt|j|§§ 
point.) Write a small test program to see what your compiler does if you try to useljjj 
the expression a == b, where a and b are structures of the same type. 
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28 (Project) The double-ended queue is a very useful ADT. To implement this ADlf|§i 
start with the following constructs: IH 






struct deque { 


/* double ended queue */ 




typedef struct deque 
typedef enum {false, true} 
typedef data 


deque; 

boolean 

char; 


Jg|§ 



The programmer-defined type deque implements a double-ended queue with anjl 
array. If you want the data stored in the queue to be i nts, use ill 



typedef data 


int; 


Function prototypes of useful operations include 


void 

void 

data 

data 

boolean 

boolean 


add_to_front(deque *, data); 
add_to_rear(deque *, data); 
take_from_front(deque *); 
take_from_rear(deque *); 
empty(deque *); 
full(deque *); 


Write and test an implementation of the double-ended queue .ADT. (In Section 10.7, 
“Queues,” on page 471, we will see how to do this with a linked list.) 
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Although the program in this exercise is system-dependent, it will run correctly on 
many different systems. See if it runs correctly on your machine. 

/* The mystery program. */ 

#i f (VAX | j PC) 

#define HEX0 0x6c6c6548 

#define HEX1 0x77202c6f 

#define HEX2 0x646c726f 

#define HEX3 0x00000a21 

#el se 

#define HEX0 0x48656c6c 

#define HEX1 0x6f2c2077 

fdefine HEX2 0x6f726c64 

fdefine HEX3 0x210a0000 

#endif 

typedef union { 
char what[16]; 
long cipher[4]; 

} mystery; 

int main(void) 

{ 

mystery x; 

x.cipher[0] = HEX0; /* put a hex on the mystery */ 

x.cipher[X] = HEX1; 

x.cipher[2] = HEX2; 

x.cipher[3] = HEX3; 

printf("%s", x.what); 

return 0; 

} 

Explain how the mystery program works. If the phrase that is printed is unfamiliar 
to you, ask a programmer who has worked in C for a number of years what its sig¬ 
nificance is. 
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Structures and List Processing 



In this chapter, we explain self-referential structures. We define structures with pointer 
members that refer to the structure containing them. Such data structures are called 
dynamic data structures. Unlike arrays or simple variables that are normally allocated 
at block entry, dynamic data structures often require storage management routines to 
explicitly obtain and release memory. 




10.1 Self-referential Structures 








Am 




Let us define a structure with a member field that points at the same structure type. We 
wish to do this in order to have an unspecified number of such structures nested 
together. 


struct list { 
i nt 

struct list 
} a; 


data; 

*next; 


This declaration can be stored in two words of memory. The first word stores the mem¬ 
ber data, and the second word stores the member next. The pointer variable next is 
called a link. Each structure is linked to a succeeding structure by way of the member 
next. These structures are conveniently displayed pictorially with links shown as 


arrow's. 
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data next 

The pointer variable next contains either an address of the location in memory of 
the successor list element, or the special value NULL defined as 0. NULL is used to 
denote the end of the list. Let us see how all this works by manipulating 

struct list a, b, c; * 

We will perform some assignments on these structures: 


a. data 

b. data 

c. data 
a.next 


£ 

2 ; 

3; 

b.next 


c.next 


NULL 


Pictorially, the result of this code is the following: 

After assignment 


NULL 


NULL 


NULL 


Let us chain these together. 

a. next = &b; 

b. next = &c; 

These pointer assignments result in linking a to b to c. 

After chaining 





NULL 
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Now the links allow us to retrieve data from successive elements. Thus, 

a.next -> data 
has a value 2 and 

a. next -> next -> data 
has value 3. 


10.2 Linear Linked Lists 

A linear linked list is like a clothes line on which the data structures hang sequentially. 
A head pointer addresses the first element of the list, and each element points at a suc¬ 
cessor element, with the last element having a link value NULL. This discussion will use 
the following header file: 

In tile list.h 

#include <stdio.h> 

#include <stdlib.h> 

typedef char DATA; /* will use char in examples */ 

struct linked^l i st { 

DATA d; 

struct 1inked_list *next; 

}; 

typedef struct linked_list ELEMENT; 

typedef ELEMENT *LINK; 
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Storage Allocation 

lhe specifications in list.h do not allocate storage. The system can allocate storage if y 
declare variables and arrays of type ELEMENT. But what makes these structures espe 
dally useful is that utility functions exist to allocate storage dynamically. The system 

provides malloc() in the standard library, and its prototype is in stdlib.h. A function 
call of the form 

mal 1 oc (size) 

returns a pointer to enough storage for an object of size bytes. The storage is not initial 
ized. The argument to mallocO is of type size_t, and the value returned is of type 
pointer to voi d. Recall that si ze_t is the unsigned integer type that is defined in both 
stddef.h and stdlib.h. It is the type that results when the si zeof operator is used. If 
head is a variable of type LINK, then 

head = mal1oc(sizeof(ELEMENT)); 

obtains a piece of memory from the system adequate to store an ELEMENT and assign 
its address to the pointer head. As in the example, mallocO is used with the si zeof 
operator. A cast is unnecessary because mal 1 oc () returns a pointer to voi d, which can 
be assigned to a variable of any pointer type. The si zeof operator calculates the 
required number of bytes for the particular data structure. 

In the code that follows, we will dynamically create a linear linked list storing the 
three characters n, e, and w. The code 
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head 

head 

head 


- mal1oc(sizeof(ELEMENT)) 
~> d = r n’; 

-> next = NULL; 


f: 


m - <mmm 


creates a single-element list. 


Creating a linked list 


NULL 
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A second element is added by the assignments 






head 

head 

head 


> next = malloc(sizeof(ELEMENT)); 
•> next -> d = ' e'; 

-> next -> next = NULL: 
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It is now a two-element list. 
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A two-element linked list 


LMl 


Finally, we add a last element: 

head -> next -> next = malloc(sizeof(ELEMENT)); 
head -> next -> next -> d = 'w 1 ; 
head -> next -> next -> next = NULL; 

We have a three-element list pointed to by head, and the list ends when next has the 
sentinel value NULL. 


A three-element linked list 





NULL 


10.3 List Operations 


The following list includes some of the basic linear list operations: 


Linear Li: 


1 Creating a list 

2 Counting the elements 

3 Looking up an element 

4 Concatenating two lists 

5 Inserting an element 

6 Deleting an element 

We will demonstrate the techniques for programming such operations on lists using 
both recursion and iteration. The use of recursive functions is natural because lists are 
a recursively defined construct. Each routine will require the specifications in file list.h. 
Observe that d in these examples could be redefined as an arbitrarily complicated data 
structure. 
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As a first example, we will write a function that will produce a list from a string. The 
function will return a pointer to the head of the resulting list. The heart of the functiojj 
creates a list element by allocating storage and assigning member values. 

/* List creation using recursion. */ 

#include <stdlib.h> 

#include "list.h" 

LINK string_to_list(char s[]) 



-a 

gas£2 fan: 

M |S#«# 


LINK head 





if Cs[0J == '\0') /* base case */ 

return NULL; 
el se { 

bead = mal1oc(sizeof(ELEMENT)); 
head -> d = s[0]; 

head -> next = string_to_list(s + 1) 
return head; 


P ■ 

pfef:? ; 1111 
. w . - ' 5 

(m 




Notice once more how recursion has a base case (the creation of the empty list) and a 
general case (the creation of the remainder of the list). The general recursive call 5 
returns as its value a LINK pointer to the remaining sublist 
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Dissection of the string_to_list() Function 


■ LINK string_to_list(char s[]) 

{ 

LINK head; 

l^hen a string is passed as an argument, a linked list of the characters in the string is 
created. Because a pointer to the head of the list will be returned, the type specifier in 
the header to this function definition is LINK. 




‘v'V-‘"Vv 

mm «iS 

3mm 




WlM' Wmk 

m ■: • • 


; - 




tiiP SIP 

HHs 
iK mm 




if (s[0] == ’\0’) 
return (NULL); 


/* base case */ 



When the end-of-string sentinel is detected, NULL is returned, and, as we will see, the 
recursion terminates. The value NULL is used to mark the end of the linked list. 
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else { 
head 


malloc(sizeof(ELEMENT)) 


If the string s [] is not the null string, then mal 1 oc() is used to retrieve enough bytes 
| to store an object of type ELEMENT, Because mallocO returns a pointer to void, it can 
be assigned to the variable head, which is a different pointer type. A cast is unneces- 
| sary. The pointer variable head now points at the block of storage provided by mal - 
loc(). 

■ head -> d = s[0]; 

The member d of the allocated ELEMENT is assigned the first character in the string s []. 

■ head -> next = string_to_Jist(s + 1); 

The pointer expression s + 1 points to the remainder of the string. The function is 
called recursively with s + 1 as an argument. The pointer member next is assigned the 
pointer value that is returned by stri ng_to_l i st(s + 1). This recursive call returns 
as its value a LINK or, equivalently, a pointer to ELEMENT that points to the remaining 
sublist. 

■ return head; 


The function exits with the address of the head of the list. 


This function can also be written as an iterative routine with the help of the addi¬ 
tional auxiliary pointer tail. We will name the iterative version s_to_l () to distin¬ 
guish it from the recursive version stri ng_to_l i st(). 

In file iter Jist.c 


/* List creation using iteration 

#include <stdlib.h> 

#include "list.h" 

LINK s_to_l(char s[]) 

{ 

LINK head = NULL, tail; 

int i; 
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if (s[0] != '\0') { /* first element */ 

head = mallocCsizeof(ELEMENT)); 
head -> d = s[0]; 
tail = head; 

for (i =1; s [i ] != '\0’; ++i) { /* add to tail */ 

tail ~> next = malloc(sizeof(ELEMENT)); 
tail = tail -> next; 
tail -> d = s[i]; 

} 

tail -> next = NULL; /* end of list */ 

} 

return head; 

} 

Functions operating on lists often require local pointer variables such as head and 
tai 1. One should use such auxiliary pointers freely to simplify code. It is also impor¬ 
tant to hand-simulate these routines. It is useful to try your program on the empty list, 
the unit or single-element list, and the two-element list. Frequently, the empty list and 
the unit list are special cases. 

Passing a null string to s_to_l () creates the empty list by having the routine return 
with value NULL. Creating the first element is done by the first part of the code. The 
one-element list created from the string "A" is shown in the following diagram. This is 
the state of the computation before the member next is assigned the value NULL. 

A one-element list 

head 
tail 

For the two-element case, say "AB", list creation is as pictured. First, the one-element 
list containing 'A' is created. The for statement is then executed, with i having value 1 
and s [1] having value ' B'. A new element is then allocated and attached to the list. 

A second element is attached 

head 
tail 

The statement tai 1 = tai 1 -> next; advances tai 1 to the new element. Then its d 
member is assigned ' B 1 . 
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Updating the tail 


head 



tail 


Now s [2] has value \0 and the for statement is exited with a two-element list. Finally, 
the end of the list is marked with a NULL, 


After assigning NULL 


head 



tail 


jpaappi; 


The undefined member values occur because malloc() is not required to initialize 
memory to zero. 
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10.4 Some List Processing Functions 

We will write two additional recursive functions. The first coimts the elements in a list, 
and the second prints the elements of a list. Both involve recurring down the list and 
terminating when the NULL pointer is found. All these functions use the header file 
list.h. 

The function count () returns 0 if the list is empty; otherwise, it returns the number 
of elements in the list. 

/* Count a list recursively. */ 

int count(LINK head) 

{ 

if (head == NULL) 
return 0; 

p"| cp 

return (1 + count(head -> next)); 

} 
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An iterative version of this function replaces the recursion with a for loop. 
/ Count a list iteratively* */ 


jnt countJtCUNK head) 


nit 


cnt 


0 


for 4 c ’ t heacl ! " NULL > >>«« - head -> next) 


) 


return cnt; 


Keep in mind that head is passed “rall-hv v/aino »„„ ». • 1 • 

destroy access the list "« 


e ioutine print_list() recursively marches down a list- 
member variable d; s Q0Wn a llst 


printing the value of 


/ w Print a list recursively. */ 


void print_list(LINK head) 


{ 


if (head == NULL) 
printf("NULL"); 
else { 

printf("%c —> ", head -> d); 
pnnt_list(head -> next); 


} 


> 


To illustrate the use of these functions we will writs a , . ... 

string "ABC" to a list and print if ’ wr te a proRram that WJ U convert the 


In file prn_list.c 


^include <stdio.h> 

^include <stdlib.h> 

#include "list.h" 

LINK string_to_list(char []); 

vend print_list(LINK); 

Tnt count(LINK); 
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int main (void) 

{ 

LINK h; 

h = string_to_list("ABC"); 
printf("The resulting list is\n"); 
print_list(h); 

printf("\nThis list has %d elements.\n", count(h)); 
return 0; 


The program produces the following output: 

The resulting list is 
A —> B —> C —> NULL 
This list has 3 elements. 

Often one washes to take two lists and return a single combined list. The concatena¬ 
tion of lists a and b, where a is assumed to be nonempty, will be the list b added to the 
end of list a. A function to concatenate will march down list a looking for its end, as 
marked by the null pointer. It will keep track of its last non-null pointer and will attach 
the b list to the next link in this last element of list a. 

/* Concatenate list a and b with a as head. */ 

void concatenate(LINK a, LINK b) 

{ 

assert(a != NULL); 
if (a -> next == NULL) 
a -> next = b; 
else 

concatenate^ -> next, b); 

} 

Recursion allows us to avoid using any auxiliary pointers to march down the a list. In 
general, the self-referential character of list processing makes recursion natural to use. 
The form of these recursive functions is as follows: 


void generic_recursion(LINK head) 

{ 

if (head == NULL) 
do the base case 

d1 C0 

C I jC 

do the general case and recur with 
generic_recursion(head -> next) 
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Insertion 

onceTh^p^monln’thX^f S ° fiS thal insertion takes a fixed amount of time 
large arraf SaMn/ah 18 nd * Contra8t ’ if one wished t0 Phce a value in a 

would take ? array Valu T in the same 8e Quential order, the insertion 

elements of the ar v th, ” t0 ^ length of the arra ^ The values of all 

over one element. Y * dfter the neWly mserted vaIue wouid have to be moved 

and^^an!^ mSe h Ti0n3 Ust by having two adjacent elements pointed at by pi 
and P 2, and msertmg between them an element pointed at by q. ' P 


Before insertion 



Ihe following function i nsert() places the element nnmtpd at m,- „ i, , 

ments pointed at by pi and p2: element pointed at by q between the ele 

/" Inserting an element in a linked list. */ 
void insert(LINK pi, LINK p2, LINK q) 

assertfpl -> next == p2); 

a 1 r n !vf = 5’ /* 'insert */ 

q -> next = p2; 

} 


After insertion 
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Deletion 


Deleting an element is very simple in a linked linear list. The predecessor of the ele¬ 
ment to be deleted has its link member assigned the address of the successor to the 
deleted element. Again, let us first illustrate the delete operation graphically. Here is 
the picture that we start with: 


Before deletion 



A 


B 


Now we execute the following code: 


p -> next = p -> next -> next 


After deletion 






As the diagram shows, the element containing ’B’ is no longer accessible and is of no 
use. Such an inaccessible element is called garbage. Because memory is frequently a 
critical resource, it is desirable that this storage be returned to the system for later use. 
This may be done with the stdlib.h function freeQ. When called as free(p), previ¬ 
ously allocated storage for the object pointed to by p is available to the system. The for¬ 
mal argument to free() is pointer to void. 

Using f ree(), we can write a deletion routine that returns allocated list storage to 
the system. 


/* Recursive deletion of a list. */ 


void delete_list(LINK head) 

{ 

if (head != NULL) { 

delete_list(head -> next); 
free(head); 


/* release storage */ 
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Because free() takes a single argument of type void *, we can pass a pointer of any 
type to free(). We clo not have to use a cast because void * is the generic pointer 
type. 


10.5 Stacks 


We presented the abstract data type stack as an array in Section 9.10, “The ADT Stack,” 
on page 430. Here, we will reimplement the ADT stack with a linear linked list. A stack 
has access restricted to the head of the list, which will be called its top. Furthermore, 
insertion and deletion occur only at the top, and under these restrictions the operations 
are known as push and pop, respectively. 

A stack can be visualized as a pile of trays. A tray is always picked up from the top 
and a tray is always returned to the top. Graphically, stacks are drawn vertically. 


A stack implementation 
stack elem 



T 

data 

NULL 


10.5 ▼ Stacks 461 


* 










mf" 




§mi 





IP 


Y ; 




p /- 1 ; h r' - / / - . r ’ i 

L.^.- Am je&mm 




mm ■' ■ f j m 

L V' 

Ifc - # 

ig»«sia 




|J We w ni write a stack program that consists of a .h file and two x files. Here is the 
H header file: 


In file stack.h 
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1 i n ked 1 i st i tnpl emenbalii on of* a shack * / 

#include <stdio.h> 

#include <stdlib.h> 


#define 

#define 

typedef 

typedef 


EMPTY 

FULL 


10000 


char 

enum {false, true} 


struct elem { 
data 

struct elem 

}; 


data; 
boolean; 

/* an element on the stack */ 


next 


typedef struct elem elem, 

struct stack { 
int cnt; 
elem *top; 


typedef struct stack stack 


/* a count of the elements */ 
/* ptr to the top element */ 


void 

void 

data 

data 

boolean 

boolean 


initialize(stack *stk); 
push(data d, stack *stk); 
pop(stack *stk); 
top(stack *stk); 
emptyCconst stack ,( stk); 
full(const stack *stk); 


The function prototypes of the six standard stack operations that w e are gomg to 
imDlement are listed at the bottom of our header file. Conceptually, these functions 

behave like those we presented in Section 9.10, “The ADT Stack ” on 

use of the type data makes our code reusable. (See Section 10.6, An Example. Polish 

Notation and Stack Evaluation,” on page 464, where we use the code again.) 
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In file stack.c 


/" The basic stack routines. */ 

#include "stack.h" 

void i„itialize(stack •«« 

stk ~> cnt = 0; 
stk -> top = NULL; 

} 

void push(data d, stack *stk) 
elem *p; 

p = malloc(sizeof(elem)); 
p -> d = d; 

p -> next = stk -> top; 
stk -> top = p; 

stk -> cnt++; 

** ' 


data pop(stack *stk) 

data d; 
elem *p; 

d = stk -> top -> d; 

p = stk -> top; 

stk -> top = stk -> top 

stk -> cnt—; 

free(p); 

return d; 


-> next 


data top(stack *stk) 

{ 

return (stk -> top -> d) 


boolean empty(const stack *stk) 
return ((boolean) (stk -> cnt 


== EMPTY)); 
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boolean full(const stack *stk) 

{ 

return ((boolean) (stk -> cnt 

} 


FULL)); 


The pushQ routine uses the storage allocator mallocO to create a new stack element, 
and the pop() routine returns the freed-up storage back to the system. 

A stack is a last-in-first-out (LIFO) data structure. The last item to be pushed onto the 
stack is the first to be popped off. So if we were to push first ' a 1 and second ’ b 1 onto 
a stack, then pop() would first pop ’ b'. In mai n(), we use this property to print a 
string in reverse order. This serves as a test of our implementation of the ADT stack. 

In file main.c 

/* Test the stack implementation by reversing a string. */ 

#include "stack.h" 

int main(void) 

{ 

char str[] = "My name is Ioanna Kelley!"; 

int i; 

stack s; 


initializers); 

printf(" In the string: %s\n", str); 
for (i = 0; str[i] != ’\0’; ++i) 
if (!full(&s)) 

push(str[i], &s); /* 

printf("From the stack: "); 
while (!empty(&s)) 

putchar(pop(&s)); /* 

putchar('\n'); 
return 0; 


/* initialize the stack */ 


/* push a char on the stack */ 


/* pop a char off the stack */ 


Observe that our function mai n () is very similar to what we wrote in Section 9.10, “The 
ADT Stack,” on page 434. Although here we have implemented the ADT as a linked list 
and in Section 9.10, “The ADT Stack,” on page 431, we implemented the ADT as a 
string, the use of the operations is similar. Here is the output of our program: 

In the string: My name is Joanna Kelley! 

From the stack: lyelleK annaoJ si eman yM 


iiliiilr^ 
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10.6 An Example: Polish Notation and Stack Evaluation 

Ordinary notation for writing expressions is called infix , where operators separate argu¬ 
ments. Another notation for expressions, one that is very useful for stack-oriented eval¬ 
uation, is called Polish, or parenthesis-free, notation. In Polish notation, the operator 
comes after the arguments. Thus, for example, 

3,7,+ is equivalent to the infix notation 3 + 7 

In Polish notation, going from left to right, the operator is executed as soon as it is 
encountered. Thus 

17, 5, 2, *, + is equivalent to 17 + (5 * 2) 

A Polish expression can be evaluated by an algorithm using two stacks. The Polish 
stack contains the Polish expression, and the evaluation stack stores the intermediate 
values during execution. Here is a two-stack algorithm that evaluates Polish expressions 
where all operators are binary: 

A two-stack algo rithm to eva luate Pol ish expressions 

1 If the Polish stack is empty, halt with the top of the evaluation stack as the 
answer. 

2 If the Polish stack is not empty, pop the Polish stack into d. (We will use d, dl, 
and d2 to hold data.) 

3 If d is a value, push d onto the evaluation stack. 

4 If d is an operator, pop the evaluation stack twice, first into d2 and then into dl. 
Compute dl and d2 operated on by d, and push the result onto the evaluation 
stack. Go to step 1. 

We illustrate this algorithm in the following diagram, where the expression 
13 s 4, -, 2, 3, \ + 


is evaluated. 
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4b b 
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A two-stack algorithm to evaluate Polish expressions 


E 

v 

a 
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P u 4 
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It 2 
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so * 
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3 

* 
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t ^_ t jninlements this two-stack algorithm. A key idea will be 
Let us write a program that ra.pl torm „f an integer or an open 

" form S o°f ^“ program win consist of a .* fiie and five * Mes. 

- n.l 


ere is our header file: 


In file polish.h 

/it A linked list implementation of a Polish stack. V 


#include <assert.h> 
#include <ctype.h> 
#include <stdio.h> 
#include <stdlib.h> 


#define EMPTY 
#define FULL 


0 

10000 


struct data { 

enum {operator, valuer 

union { 

char op; 
int val; 

} 

}; 


ki nd; 



typedef 

typedef 


struct data 
enum {false, true} 


data; 
boolean; 


„ , /* an element on 

struct elem { 

data d; 

struct elem ’-'next; 


the stack */ 


typedef struct elem elem; 
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struct stack { 
int cnt; 
elem *top; 


/* a count of the elements */ 
/* Ptr to the top element */ 


typedef struct stack stack; 
boolean empty(const stack *stk)■ 

void evaluate(stack ^polish); 

hnnl oa n fTljCstack *stk, const char *str); 

boolean full(const stack *stk); 

void initialize(stack *stk)■ 

aata pop(stack *stk); 

void prn_data(data *dp); 

void prn_stack(stack *stk); 

push(data d, stack *stk); 
aata top(stack *stk); 


f ‘ le , ‘ S Similar t0 the one ™ 1" our stack program in 

be a s,r„c,u re 1, *. mato «*"■• * that « have deft,led data 

yp . The structure contains a union that can hold either an i nt vainp 

an operator in the form of a char. It also contains a "flag” in the o™ rt Z 

t.on type. The flag will toll „ which Kind of data is being stored “ "** 


In file main.c 


/ Test the two stack Polish evaluation algorithm. */ 
^include "polish.h" 


int main(void) 

char str[] = " 13 , 4 2 3 * +"• 

stack polish; ’ ’ ’ ’ + ’ 


} 


printf("\ n %s%s\n\n", 

^ "Polish expression: 
fill(^polish, str); 
prn_stack(&polish)• 
printf("\n%s%d\n\n", 
"Polish evaluation: 
return 0; 


str) ; 

/* fill stack from string */ 
/* print the stack */ 

evaluate(&polish)); 


jhil^ls wmtog plone* 18 ™'T k tom v*”* and prlnt the «** » check that every- 
tho rt ik ? r0perly * rhe Unction that is of most interest is the one that evaluates 

the Polish stack. Let us present that function next. dt 
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In file eval.c 


/* Evaluation of the Polish stack. */ 

#include "polish.h" 

int evaluate(stack *polish) 

{ 

data d, dl, d2; 
stack eval; 

initialize(&eval); 
while (!empty(polish)) { 
d = pop(polish); 
switch (d.kind) { 
case value: 

push(d, &eval); 
break; 

case operator: 

d2 = pop(&eval); 
dl = pop(&eval); 

d.kind = value; /* begin overwriting d */ 

switch (d.u.op) { 
case '+': 

d.u.val = dl.u.val + d2,u.val; 
break; 
case '-': 

d.u.val = dl.u.val - d2.u.val; 
break; 
case '*': 

d.u.val = dl.u.val * d2.u.val; 

} 

push(d, &eval); 

} 

} 

d = pop(&eval); 
return d.u.val; 

} 

The evaluateO function embodies the two-stack algorithm that we presented earlier. 
We first pop d from the pol ish stack. If it is a value, we push it onto the eval stack. If 
it is an operator, we pop d2 and dl off the eval stack, perform the indicated operation, 
and push the results onto the eval stack. When the polish stack is empty, we pop d 
from the eval stack and return the i nt value d. u. val. 

We write the stack operations in the file stackx. Except for the inclusion of a different 
header file, there is no difference at all from the file stackx, which we wrote in Section 
10.S, “Stacks,” on page 461. 
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In file stack.c 

/’■ The basic stack routines. */ 

#include "polish.h" 

void initialize(stack *stk) 

stk -> cnt = 0; 
stk -> top = NULL; 

void push(data d, stack *stk) 
elem *p; 

p = malloc(sizeof(elem)); 
p -> d = d; 

p -> next = stk -> top; 
stk -> top = p; 

stk -> cnt++; 

} 

™u^fmTh 1 , e T' ati r ln Sectl0n 10 ' 5 ' " S,acks '" ° n page 461 ' the 'W> e d » a 

ir “ a ™ f , ha ( here d f ta IS 3 structure type. By using a typedef to embody the 
am nL, n “Tl' ADT stack implementation reusable. This is an impor- 

less work m H USm fv. COde that has already been wrltten and tes,cd ' the programmer has 
less work to do m the current project. 

We need to be able to fill a stack from a string that contains a Polish expression. Here 
is our function that does this: 

In file fill.c 

#include "polish.h" 

void fill(stack *stk, const char *str) 

const char *p = str- 
char cl, c2; 

boolean bl, b2; 

data d; 

stack tmp; 

initialize(stk); 
initialize(&tmp); 
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// First process the string and push data on tmp. 

V - 

while (*p != '\0') { 

while (isspace(*p) || *p == 

++p; 

bl = (boolean) ((cl = *p) == ’+’ || cl == || cl == 

b2 = (boolean) ((c2 = *(p + 1)) == || c2 == '\0'); 

if (bl && b2) { 

d.kind = operator; 
d.u.op = cl; 

} 

else { 

d.kind = value; 

assert(sscanf(p, "%d", &d.u.val) == 1); 

} 

if (IfulK&tmp)) 

push(d, &tmp); /* push data on tmp */ 

while (*p != ',' && *p != '\0') 

++p; 




} 

/ 


* 


// Now pop data from tmp and push on stk. 

V 

while (!empty(&tmp)) { 

d = pop(&tmp); /* pop data from tmp */ 

if (!full(stk)) 

push(d, stk); /* push data on stk */ 

} 


First we process the string to extract the data. As we find the data, we push it onto the 
stack tmp. After we finish processing the string, we pop the data off of tmp and push it 
onto the stack pointed to by stk. We do this so that the element in the stack pointed to 
by stk will be in the right order. 

We write two printing functions so that we are able to check that our code is working 
properly. Here are the functions: 
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In file print.c 


#include "polish.h" 

void prn_data(data *dp) 

{ 

switch (dp -> kind) { 
case operator: 

printf("%s%3c\n", 
"kind: operator 
break; 
case value: 

printf("%s%3d\n", 
"kind: value 

} 


op:", dp -> u.op); 


val:", dp -> u.val) 


void prn_stack(stack *stk) 
data d; 

printf("stack count:%3d%s" 
stk -> cnt, (stk -> cnt 
if (!empty(stk)) { 

d = pop(stk); /■ 

prn_data(&d); /• 

prn_stack(stl<); /■ 

push(d, stk): /> 


== 0) ? "\n" : " 

pop the data */ 
print the data */ 
recursive call */ 
push the data */ 


he algorithm to print the stack is quite simple. First, pop d off the stack and print i 

en ma e a recursive call to prn_stack(). Finally, push d back onto the stack. Th 

e ec o this is to print all the data in the stack and to leave the stack in its origint 
state. Here is the output from our program: 


stack 

stack 

stack 

stack 

stack 

stack 

stack 

stack 


count: 
count: 
count: 
count: 
count: 
count: 
count: 
count: 


13, 4, 

“. 2 3 * 

+ 


ki nd: 

value 

val: 

13 

ki nd : 

value 

val: 

4 

kind: 

operator 

op: 


ki nd: 

val ue 

val: 

2 

kind: 

value 

val: 

3 

kind: 

operator 

op: 

* 

ki nd: 

operator 

op: 

+ 


Polish evaluation: 15 


10.7 Queues 


A queue is another abstract data type (ADT) that we can implement as a linear linked 
list. A queue has a front and a rear. Insertion occurs at the rear of the list, and deletion 
occurs at the front of the list. 

A queue implementation 


queue 



Our implementation of the ADT queue will include a number of the standard queue 
functions. Here is our header file: 

In file queue.h 

/* A linked list implementation of a queue. */ 

#include <assert.h> 

#include <stdio.h> 

#include <stdlib.h> 

#define EMPTY 0 

#define FULL 10000 

typedef unsigned int data; 

typedef enum {false, true} boolean; 

struct elem { 
data 

struct elem 

}; 


d; 

*next; 


/* an element in the queue */ 
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-an 


typedef struct elem 
struct queue { 
int cnt; 
elem *front; 
elem *rear; 

}; 


elem 




/* a count of the elements */ 
/* ptr to the front element */ 
/* ptr to the rear element */ 


typedef 

void 

void 

data 

data 

boolean 

boolean 


struct queue queue; 
initialize(queue *q); 
enqueue(data d, queue *q) 
dequeue(queue *q); 
front(const queue *q); 
empty(const queue *q); 
full(const queue *q); 


In file queue.c 

/* The basic queue routines. */ 
#include "queue.h" 

void initialize(queue *q) 

q -> cnt = 0; 
q -> front = NULL; 
q -> rear = NULL; 

} 
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At the bottom of the header file, we put the list of function prototypes. We write the 

function definitions in the file queue.c. These functions, together with this header file t 
implement the ADT queue. 
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In file queue.c 


data dequeue(queue *q) 

{ 

data d; 
elem *p; 

d = q -> front -> d; 
p = q -> front; 
q -> front = q -> front 
q -> cnt—; 
free(p); 
return d; 


-> next 


■H 


IIP' 


■ 


wmm 


} 
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void enqueue(data d, queue *q) ' 

{ 

elem *p; 

p = malloc(sizeof(elem)); 
p -> d = d; 
p -> next = NULL; 
if (!empty(q)) { 

q -> rear -> next = p; 
q -> rear = p; 

} 

el se 

q -> front = q -> rear = p; 
q -> cnt++; 


data front(const queue *q) 

{ 

return (q -> front -> d); 

} 


boolean empty(const queue *q) 

{ 

return ((boolean) (q -> cnt == EMPTY)); 

} 


boolean full(const queue *q) 

{ 

return ((boolean) (q -> cnt == FULL)); 


The enqueue() routine uses the storage allocator mallocO to create a new queue 
element, and the dequeue () routine returns the freed up storage back to the system. 

A queue is a first-in-first-out (FIFO) data structure. The first item to be placed onto the 
queue is the first to be removed. Queues are very useful for a variety of system pro¬ 
gramming applications. For example, they are often used to schedule resources in an 
operating system. They are also useful in writing event simulators. 

As an example, let us write an elementary resource scheduler for a two-processor 
system. We will assume that a process can request to be serviced by either processor A 
or processor B. The process will have a unique process identification (pid) number. 
After reading all the requests, the scheduler will print the schedule of processes served 
by each processor. 
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In file main.c 

/ Using queues to schedule two resources. */ 
#include "queue.h" 

int main(void) 


int 

int 

int 

data 

queue 


c; 

cnt_a 
cnt_b 
pid; 
a, b; 


/* process id number * 


initialized); 

- initialized); 

/“Enqueue the requests */ 
while ((c = getcharO) i= EOF) { 
switch (c) { ; 1 

case 'A': 

assert(scanf("%u", & p i d ) == n■ 

if (!ful1(&a)) V } 

enqueue(pid, & a ); 

break; 
case ? b 1 * 

if s afuiic«b)) %u ' ■ spid) ■■ 

enqueue(pid, &b); 


pri quests and print them.*/ 

mL-i ^ r >,' nA s schedule.-\n") ; 
while (!empty(&a)) { 

pid = dequeued); 

} print fC 30B %u is %d\n", ++cnt_a, pid); 

Printf("\nB's scheduled")- 
while (!empty(&b)) { ’ 

pid = dequeue(&b); 

j pr i nt f( JOB %u is %d\n", ++cnt_b, pid); 
return 0; 
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There is one way to march down a linear list, namely from head to tail. However, there 
are several natural ways to visit the elements of a binary tree. The three commonest are 
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These standard methods of visitation are the basis for recursive algorithms that 
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typedef struct node NODE- 

typedef NODE *BTREE * 

#include lr fct_proto.h° 
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/* Inorder binary tree traversal. */ 


» | * 1 / n •»!*" r*k r- fr—* « \ 

void i norder(BFREE root) 

{ 

if (root != NULL) { 

inorder(root -> left); 
printf("%c ", root -> d); 
inorder(root -> right); 

} 

} 


/* recur left */ 
/* recur right */ 


The function i norderO will print the values of each node in the binary' tree pointed 
at by root. The pictured binary tree would be traversed by i norderO, printing 

ABCDEFGHIJ 

A binary tree 
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The corresponding preorder and postorder functions are as follows: 

eorder and postorder binary tree traversal, */ 
void preorder(BTREE root) 

if (root != NULL) { 

printf("%c ", root -> d)• 
preorderfroot -> left)- ’ 
preorderfroot -> right); 

} 

void postorder(BTREE root) 

if (root != NULL) { 

postorderfroot -> left)* 
postorderfroot -> right)- 
} Priotf("% c ", root -> d); 

} 

Preorder visitation of the binary tree just shown would print 
G ° 8 A C F E I H 3 

Postorder visitation would print 


^ C B E F D 


H J I c 


“n LTt K £S “rittas" CareMly VerIlV by hand. 

Creating Trees 


We will create a binary tree fmm n, , 

use the dynamic storage allocator mall oca' 38 “ array; ' As with ^ts, 
/* Creating a binary tree. */ 

BTREE new_node(void) 
i 

j. return (malloc(sizeof(NODE))) ; 


we will 
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BTREE init_node(DATA dl, BTREE pi, BTREE p2) 

{ 

BTREE t; 

t = new_node(); 
t -> d = dl; 
t -> left - pi; 
t -> right = p2; 
return t; 


We will use these routines as primitives to create a binary tree from data values 
stored in an array. There is a very nice mapping from the indices of a linear array into 
nodes of a binary tree. We do this by taking the value a[i ] and letting it have as child 
values a[2*i+l] and a[2*i+2]. Then we map a[0] into the unique root node of the 
resulting binary tree. Its left child will be a [1], and its right child will be a[2]. The 
function create_tree() embodies this mapping. The formal parameter size is the 
number of nodes in the binary tree. 

/* Create a linked binary tree from an array. */ 

BTREE create_tree(DATA a[], int i, int size) 

{ 

if (i >= size) 
return NULL; 
el se 

return (init_node(a[i], 

create_tree(a, 2 * i + 1, size), 
create_tree(a, 2 * i +2, size))); 

} 


10.9 General Linked Lists 

For some data structures, we wish to combine the use of arrays with the use of lists. 
The arrays provide random accessing, and the lists provide sequential accessing. We 
will show one such example in this section, an implementation of a general tree. In a 
general tree, a node can have an arbitrary number of children. It would be very wasteful 
to specify a structure using the maximum number of links for each node. 

We will represent a general tree as a number of linear linked lists, one for each node 
in the tree. Each list will be the children of a single node. The lists will have an array 
that will point at the first child of the corresponding node. The base element of the 
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476: * general tree at the beginning of Section io.8, “Binary Trees," on pa. 
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A general tree and associated list structure 


NULL 


NULL 


NULL 


NULL 


NULL 


Such trees can be represented using the following header file: 


In file gtree.h 

^include <assert.h> 
#include <stdio.h> 
#include <stdlib.h> 

typedef char DATA 


struct node { 
int 
DATA 

struct node 


chi1d_no; 
d; 

*sib: 


typedef struct node NODE- 

typedef NODE *GTREE; 

#include "fct_proto.h" 


/* function prototypes */ 
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We will use an array of type GTREE, say t, where t[0] points to the root element rep- 
jH resented as type NODE. Siblings will be reached by linear list chaining, and children by 
|§1 array indexing. Let us examine the creation of a particular such tree. We first write rou- 
|J§ tines to create a single node. 










1H1 


8 k'. 


/•• Create a new node. */ 

GTREE new_gnode(void) 

{ 

return (malloc(sizeof(NODE))); 

> 

GTREE init_gnode(DATA dl, int num, GTREE sibs) 

{ 

GTREE tmp; 

tmp = new_gnode(); 
tmp -> d = dl; 
trap -> child„no = nutn; 
tmp -> sib = sibs; 
return tmp; 

} 

Let us use these routines to create the tree of the previous diagram. Because it con¬ 
tains eight nodes, we need an array t[] of size 9 and type NODE, where t[0] is the 
pointer to the root node. 


t [0] 
t[l] 
t[l] 
t[l] 
t[2] 
t [2] 
t [2] 
t[3] 
t [4] 

*. rci 

t Lb J 
t [6] 
t [7] 
t[8] 


- irsit. 
= lmt 

> sib 
•> sib 
= init. 
■> si b 

> sib 
= NULL 
= NULL 
= NULL 
= init. 
= NULL 
. NULL 


.gnodeC'a', 1, NULL); 

.gnodeC'b' , 2, NULL); 

= init_gnode('f', 6, NULL); 

-> sib = init_gnode(’h', 8, NULL); 
,gnode( ’ c' , 3, NULL) ; 

= init_gnode('d', 4, NULL); 

-> sib = init_gnode('e', 5, NULL); 


.gnodeCg', 7, NULL) 


It is easy to detect certain properties in this representation, such as whether a node is a 
leaf node or how many children a node has. For node n, if t[n] points at NULL, then the 
node is a leaf. 
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Traversal 

aversal becomes a combination of ( 1 ) moving alone lists and o\ mH 
elements that point at the lists 8 f (2) mdexm S into 

order, and inorder to these stmrr g the trdversaI ordering of preorder, 

rithms are protoType^ ^ V* ^ 

beCaUSe the) ' *““»<«« tt»t =ach demen, wii, be reached m iLa“ me * 

/ Preorder traversal of general trees. */ 
void preorder_g(GTREE t, int ind) 

CTREE tmp; /* tmp traverses the sibling list */ 

while (tmp d ]= NULL) { t[ind] is the r °ot node */ 

printf("% c %d\n", tmp -> d, tmp -> child nnV 
preorder.gCt, tmp -> child_no)T chlld - no )- 
tmp = tmp -> sib; 

} 

} 
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uid'lbr"' lum ,"": 1 Cal 1 oc ° P rov ' de s contiguous allocation of stornm tbit 
used Ioi arrays, its function prototype is given ?„ s,Mb.h as 6 ““ ** 

void *calloc(si 2 e_t n, size_t size): 

“ ”o ^ »K. the vaiue returned 

of the form Typically, si ze_t is equivalent to unsigned.A function call 

callocCn, size) 
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Heeded rather than specifying during compilation an array size that is very large so as 
to accommodate all cases that might be of interest. 

For example, we will want a routine that can build a general tree from a list of edges 
Sfiid an array of type DATA. If we wish to have an array of size 10 to store the tree head¬ 
sets, we can write 


m ■ 


§ t = calloc(10, sizeof(CTREE)); 

jj Now we can pass the dynamically allocated array t of type pointer to CTREE to a func¬ 
tion bui 1 dtree() in order to construct a general tree. This function will take an edge 
§f list representation of a tree and compute its general list structure representation. 

/* Function buildtree creates a tree from an array of edges. */ 




Sftei 


typedef struct { 
int 
i nt 
} PAIR; 


/* PAIR represents an edge in a tree */ 


out; 
i n; 


mm 
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void buildtree(PAIR edges[], DATA d[], int n, CTREE t[]) 
m { 

int i; 

int x, y; /* points of edge */ 

t[0] = init__gnode(d[1], 1, NULL); 

/* t[0] takes node 1 as root */ 

for (i =1; i <= n; ++i) 
t[i] = NULL; 

for (i =0; i < n - 1; ++i) { 
x = edges[i].out; 
y = edges[i]-in; 

t[x] = init_gnode(d[y], y, t[x]); 

} 

} 

Similar data structures and functions can be used to develop representations of gen¬ 
eral graphs, sparse matrices, and complicated networks. 
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Summary 


1 Self-referential structures use pointers to address identically specified elements. 


2 The simplest self-referential structure is the linear linked list. Each element 
to its next element, with the last element pointing at NULL, defined as 0. 


3 The function malloc() is used to dynamically allocate storage. It takes an 
ment of type size_t and returns a pointer to void that is the address of the 
cated storage. 


4 The function f ree() is a storage management routine that returns to av 
storage the block of memory pointed at by its argument. 


5 Standard algorithms for list processing are naturally implemented recursively, 
quently, the base case is the detection of the NULL link. The general case recurs 
moving one link over in the list structure. 


6 When algorithms are implemented iteratively, one uses an iterative loop, t 

ing when NULL is detected. Iterative algorithms trade the use of auxiliary pointers 
for recursion. 


7 The abstract data type (ADT) stack is iinplementable as a linked list, with access 
restricted to its first element, which is called the top. The stack has a LIFO (last- 
first-out) discipline implemented by the routines pushQ and pop(). 


8 The ADT queue is also implementable as a linked list, with access restricted to its 
front and rear ends. The queue has a FIFO (first-in-first-out) discipline implemented 
by the routines enqueue () and dequeue (). 


9 Binary trees are represented as structures with two link members and combine the 
dynamic qualities of linear lists with, on average, significantly shorter access paths 
to each element. The distances to elements of binary trees are usually logarithmic, 


10 Binary trees are traversed most often in one of three major patterns: preorder, inor¬ 
der, or postorder. Each ordering is determined by when the root is visited. Preorder 
visits the root first; inorder, after the left subtree; and postorder, last. These tra¬ 
versal patterns are readily implemented as recursions, chaining down both left and 
right subtrees. 
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11 Data structures of formidable complexity, involving both lists and arrays, can be 
specified. One example of their use is in the implementation of a general tree, 
where a node has an arbitrary number of children. The children of a node are repre¬ 
sented as a list pointed to by an array of header elements. 


Exercises 


1 Try the following code. Why does your compiler complain? Fix the code so it com¬ 
piles correctly: 

struct husband { 

i nt age; 

char name[10]; 

struct wife spouse; 

} a; 


struct wife { 
i nt 
char 

struct husband 
} b; 


age; 

name[10]; 
spouse; 


2 Change the type definition of DATA in the file list.h to 

typedef struct { 
char name[10]; 
int age; 

int weight; 

> DATA; 

and write a function create_1 () that transiorms an array of such data into a linear 
list. Write another routine, one that will count the number of people above a given 
weight and age. 

3 Given a linear list of the type found in the previous exercise, write a routine 
sort_age() that will sort the list according to its age values. Write a function 
sort_name() that will sort in lexicographic order based on the name values. 


4 Combine the sorting functions in the previous exercise so that they share the most 
code. This is best done by defining a routine called compare() that returns either 0 
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15 An element of a doubly linked line nr list cnn be defined ns 

typedef struct dllist { 

DATA d; 

struct dllist *prev; 

struct dllist *next; 

} ELEMENT; 

This adds an extra member but allows easier traversal along the list. 


A doubly linked list 


head 



Write iterative routines to perform insertion and deletion. 

16 Write a routine del_dupl () that deletes duplicate valued elements in a doubly 
linked list. 


17 Evaluate the following Polish expressions by hand: 


7, 6, 3, * 

9 , 2 , 3 , *, 4 , -, + 

1, 2, +, 3, 4, +, ” 

18 Write corresponding Polish expressions for the following: 


(7 + 8 + 9) * 4 

(6 - 2) * (5 + 15 * 2) 

6-2 *5 + 15 *2 

19 Use the polish program that we wrote in Section 10.6, “An Example: Polish Notation 
and Stack Evaluation,” on page 466, to evaluate the six Polish expressions given or 
derived from the two above exercises. 
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20 The following lines occur in the eval uateO function in.the polish program: 

i 

case operator: | 

d2 = pop(&eval); j 

dl = pop(&eval); 

What happens if we write 

case operator: 

dl = popC&eval); 
d2 = popC&eval); 

instead? The program should work properly on some Polish expressions but not on 
others. Explain. 

21 Create another version of the polish program. First, write a routine that reverses the 
order of the elements in a stack. Next, append this routine to the file stack.c, and 
add the function prototype to the header file. Then rewrite the function f i 11 () so 
that it makes use of your routine. Finally, test your program. 

22 Write a routine that allows you to interactively initialize the contents of a Polish 
stack. Write a program to test your routine. 

23 When we tested our polish program, the Polish expression w r e used did not have any 
unary operators in it. Try the program with the following expression: 
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Is the polish program able to handle this correctly? (It should be able to.) Write 
another version of the polish program that can handle both unary plus and unary 
minus operators in the list. For example, your program should be able to handle the 
following Polish expression: 


In this expression, some of the plus and minus signs are unary and some are binary. 
24 Write a function whose prototype is 

queue data_to_queue(data d[], int n); 

where n is the size of the array d. When passed an array of type data, this function 
should use the elements in the array to build a queue that gets returned. 
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25 Write a function whose prototype is 

data *queue_to„data(queue q), 

_ , . nut of the values found in the queue. The size of the array 

that builds a data array out ot tne values 

should he q * ent, 

i v^orinipr nrnoram that we wrote in Section 10,7, Queues, on 

26 " ""•- “ toput: 

X X x A 2323 B 1188 yyy zzz c 3397 3398 A 4545 X surpnS6? 

Are you surprised that it works? Explain. 

27 Write a four-resource version of the scheduler program. 

, TATpntp for the orevious exercise to allow for priorities. 

28 Modify the program that you wrote P roccssor ID number and a pri- 

Change the type data to a stnaetuxe ^hat haa aoth tt p „ md 9 Create sched . 

u es where the highest priority processes are serviced first. Cannon. Do 
confuse the concepts of -process” and -processor. 

29 Write routines for binary trees that 

1 count the number of nodes 

2 count the number of nodes having a particular value, say V 

3 print out the value of only leaf nodes 

e ,, , jrr . vw n f -j nt s such that a left child has value less than 

30 SToS, ”d a rSh« ehlldhas value greater than or equal to the value of Its 
mot Such a tree is displayed in the following diagram: 

A binary tree with ordered values 

8 


4 15 



2 6 8 17 
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Insert a new element by comparing its value to the root and recurring down rhJ 
proper subtree until NULL is reached. 8 aown ,tle 

51 exercise, write a function that uses inorder traversal to 

ptace tiic values of the nodes m sorted order ' 0 
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m an array key []. 



Deleting the root node with replacement 


Before: 


After: 


tree. A heap^a binarv^ee ^ 1 2 3 4 ' S °! tmg routlne based on an ordered binary 

dren This nrnnprtv e in w ch a node has value smaller than any of its chil- 

wish to tZ T ^ ara '! teeS that the root is the smalles t dement in the tree. We 
Ta entries ^ ° f ^ and create a heap with these values as 


1 Delete the root, placing it in an output array as the smallest element. 

2 Take the rightmost leaf node and make it the new root. 

3 twovalues^ 6 ValU6 t0 b ° th children and change with the smaller of the 

4 cMdien" t0 6XChange V3lUeS Until the CUrrmt n0de is smallcr jtl value *an its 

auavh-l!^! 011 ^ I 83 ™ 3 heaP ' Repeat lhese steps unlil the tree is empty and the 
array has the ongmal tree values in sorted order. You must figure out how to get 
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32 Write a program that deletes the root node of a binary tree and replaces the mm 
with the rightmost leaf node. replaces the root 
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the tree into heap order to begin with; see Aho, Hopcroft, and Ullman, Data Struc¬ 
tures and Algorithms (Reading, MA: Addison-Wesley, 1987). 

Print out the nodes of a binary tree in level order. The root node is at level 0. On the 
next level of nodes are the children of the previous level. First, print the root node. 
Then print the left child and the right child that are at level 1. Continue printing the 
nodes from left to right, level by level. This is also called breadth-first traversal. 

Write a routine that computes the maximum level leaf node of a general tree. 

Write a function that converts a binary tree into a general tree representation. 

Add a field called weight to our general tree structure. Write functions to 

1 compute the sum of all node weights 

2 compute the maximum weighted path, where the weighted path of node i is the 
weights of the nodes from the root to node i 

It can be proved that the maximum weighted path occurs at a leaf, given that all 
weights are nonnegative. 

Use a general linked list structure to program sparse matrix addition. A sparse 
matrix is one in which most of its values are zero. A nonzero element of a sparse 
matrix will be represented by the triple (i, j, value). For each row /, the triples will be 
linked as a linear list headed by an array of pointers row[i ]. Similarly, for each col¬ 
umn j, the triples will be linked as a linear list headed by an array of pointers 
col [j]. To add matrix A to matrix B, we take each row and merge them into a 
matrix C. If both rows have a triple with the same column value, then in the output 
row Cjj is the sum ay + by. Otherwise, the element with smallest column number 
becomes the next element of the output row. 

(Advanced) The representation in the previous exercise can also be used to do 
sparse matrix multiplication. Although addition can be performed with just row- 
linked lists, both row- and column-linked lists are used for multiplication. Write a 
program to do sparse matrix multiplication. 

(Project) Implement the sparse polynomial ADT. Use a linked list of elements to 
represent the nonzero terms of the polynomial. A term is a real coefficient and a 
power. Write a complete polynomial manipulation package. Your package should be 
able to input and output polynomials, and it should be able to add, subtract, multi¬ 
ply, and copy polynomials. 
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In this chapter, we explain how to use some of the input/output functions in the stan¬ 
dard library, including the functions printfC) and scanf(). Although we have used 
these functions throughout this text, many details still need to be explained. We 
present extensive tables showing the effects of various formats. The standard library 
provides functions related to pri ntf 0 and scanf O that can be used for dealing with 
files and strings. The use of these functions is explained. 

General file input/output is important in applications where data reside in files on 
disks and tapes. We will show how to open files for processing and how to use a pointer 
to a file. Some applications need temporary files. Examples are given to illustrate their 
use. 

The operating system provides utilities for the user, and some of these utilities can 
be used by the programmer to develop C software. A number of the more important 
tools for programmers will be discussed, including the compiler, make, touch, grep, 
beautifiers, and debuggers. 
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11.1 The Output Function printf() 

The pri ntf O function has two nice properties that allow flexible use at a high level. 
First, a list of arguments of arbitrary length can be printed, and second, the printing is 
controlled by simple conversion specifications, or formats. The function pri ntf () 
delivers its character stream to the standard output file stdout, which is normally con¬ 
nected to the screen. The argument list to pri ntf O has two parts: 

controLstring and other ^arguments 
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In the example 

printf("she sells %d %s for f", 99, "sea shells", 3.77); 
we have 
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control_string: 
other_arguments: 


"she sells %d %s for f" 
99, "sea shells", 3.77 


m mm 
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The expressions in other-arguments are evaluated and converted according to the fry 
mats m the control string and then placed in the output stream. Characters in the con 
ro s ring t at are not part ol a format are placed directly in the output stream. The 1 
sym o introduces a conversion specification, or format. A single-conversion specific! 
tion is a string that begins with % and ends with a conversion character- ! 
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as a character 

as a decimal integer 

as an unsigned decimal integer 

as an unsigned octal integer 

as an unsigned hexadecimal integer 

as a floating-point number; example: 7.123000e+00 

as a floating-point number; example: 7.123000E+00 

as a floating-point number; example: 7.123000 

in the e-format or f-format, whichever is shorter 

in the E-format or f-format, whichever is shorter 
as a string 

the corresponding argument is a pointer to void; its value is 
printed as a hexadecimal number 

the corresponding argument is a pointer to an integer into 
which the number of characters written so far is printed; the 
argument is not converted 

with the format %% a single % is written to the output stream; 
there is no corresponding argument to be converted 
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The function pri ntf () returns as an i nt the number of characters printed. In the 
; example 

i?.. 

su 

: printf("she sells %d %s for f", 99, "sea shells", 3.77); 

we can match the formats in the control string with their corresponding arguments in 
the argument list. 

Format Corresponding argument 


"sea shells" 
3.77 


Explicit formatting information may be included in a conversion specification. If it is 
not included, then defaults are used. For example, the format %f with corresponding 
argument 3.77 will result in 3.770000 being printed. The number is printed with six 
digits to the right of the decimal point by default. 

Between the % that starts a conversion specification and the conversion character 
that ends it, there may appear in order 

■ zero or more flag characters that modify the meaning of the conversion specifica¬ 
tion. These flag characters are discussed below. 

■ an optional positive integer that specifies the minimum field width of the converted 
argument. The place where an argument is printed is called its field, and the num¬ 
ber of spaces used to print an argument is called its field width. If the converted 
argument has fewer characters than the specified field width, then it will be padded 
with spaces on the left or right, depending on whether the converted argument is 
right- or left-adjusted. If the converted argument has more characters than the 
specified field width, then the field width will be extended to whatever is required. 
If the integer defining the field width begins with a zero and the argument being 
printed is right-adjusted in its field, then zeros rather than spaces will be used for 
padding. 

■ an optional precision, which is specified by a period followed by a nonnegative inte¬ 
ger. For d, i, o, u, x, or X conversions, it specifics the minimum number of digits to 
be printed. For e, E, and f conversions, it specifies the number of digits to the right 
of the decimal point. For g and G conversions, it specifics the maximum number of 
significant digits. For an s conversion, it specifies the maximum number of charac¬ 
ters to be printed from a string. 
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an optional h or 1, which is a “short” or “long” modifier, respectively. If an h is (qS 
lowed by a d, i, o, u, x, or X conversion character, the conversion specificatirf 
applies to a short int or unsigned short i nt argument. If an h is followed by 
an n conversion character, the corresponding argument is a pointer to a short iisl 
or unsigned short int. If an 1 is followed by a d, i, o, u, x, or X conversion charactelP 

the conversion specification applies to a 1 ong int or unsigned long intargfj 
ment. If an 1 is followed by an n conversion character, the corresponding arguments 
is a pointer to a long int or unsigned long int. # 


. -s~. ■' . 
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i nt 

double 


m, n; 

x = 333.7777777: 


printfC’x = %*.*f\n", m, n, x); 


/* get m and n from somewhere */ 


an optional L, which is a "long” modifier. If an L is followed by an e, E, f, g, or G conJ 
version character, the conversion specification applies to a long double argument! 
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The flag characters are 
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a minus sign, which means that the converted argument is to be left-adjusted In itjf 

field. If there is no minus sign, then the converted argument is to be right-adjusted® 
in its field. m 
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a plus sign, which means that a nonnegative number that comes from a signed con¬ 
version is to have a + prepended. This works with the conversion characters d, i, e,J 
E, f, g, and G. All negative numbers start with a minus sign. M 
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m a space, which means that a nonnegative number that comes from a si gned conver-i 
sion is to have a space prepended. This works with the conversion characters d, i, lf 
e, E, f, g, and G. If both a space and a + flag are present, the space flag is ignored. ' ; 

a a#, which means that the result is to be converted to an “alternate form” that 1 
depends on the conversion character. With conversion character o, the # causes a U 
zero to be prepended to the octal number being printed. In an x or X conversion, % 
the # causes Ox or OX to be prepended to the hexadecimal number being printed. In i 
a g or G conversion, it causes trailing zeros to be printed. In an e, E, f, g, and G con- 1 

version, it causes a decimal point to be printed, even with precision 0. The behavior I 
is undefined for other conversions. 1 

E a zero, which means that zeros instead of spaces are used to pad the field. With d, 1 

1, o, u, x, X, e, E, f, g, and G conversion characters, this can result in numbers with I 

leading zeros. Any sign and any Ox or OX that gets printed with a number will pre¬ 
cede the leading zeros. 

In a format, the field width or precision or both may be specified by a * instead of an 1 

integer, which indicates that a value is to be obtained from the argument list. Here is an j 
example of how the facility can be used: 
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If the argument corresponding to the field width has a negative value, then it is taken as 
a - flag followed by a positive field width. If the argument corresponding to the preci¬ 
sion has a negative value, then it is taken as if it were missing. 

The conversion specification %% can be used to print a single percent symbol in the 
output stream. It is a special case because there is no corresponding argument to be 
converted. For all the other formats, there should be a corresponding argument. If there 
are not enough arguments, the behavior is undefined. If there are too many arguments, 
H the extra ones are evaluated but otherwise ignored. 

The field width is the number of spaces used to print the argument. The default is 
whatever is required to properly display the value of the argument. Thus, the integer 
value 255 (decimal) requires three spaces for decimal conversion d or octal conversion 
o, but only two spaces for hexadecimal conversion x. 

When an argument is printed, characters appropriate to the conversion specification 
are placed in a field. The characters appear right-adjusted unless a minus sign is 
present as a flag. If the specified field width is too short to properly display the value of 
the corresponding argument, the field width will be increased to the default. If the 
entire field is not needed to display the converted argument, then the remaining part of 
the field is padded with blanks on the left or right, depending on whether the converted 
argument is right- or left-adjusted. The padding character on the left can be made a 
zero by specifying the field width with a leading zero. 

The precision is specified by a nonnegative number that occurs to the right of the 
period. For string conversions, this is the maximum number of characters to be printed 
from the string. For e, E, and f conversions, it specifies the number of digits to appear 
to the right of the decimal point. 

Examples of character and string formats are given in the next table. We use double¬ 
quote characters to visually delimit the field. They do not get printed. 
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Declarations and initializations 


char 


A\ s[] = "Blue moon!" 




Corresponding How it is printed in 
Format argument its field Remarks 


%“3c 


%3s 

%76s 

%-ll. 8 s 


"A" 

n 

"‘a"~ ~ ~~ 

”A~~ " 

"Blue moon!” 
"Blue" moon! ” 
Tf BTue'm M 
”Blue moo ” 


field width 1 by default 
field width 2, right adjusted 
field width 3, left adjusted 
field width 10 by default 
more space needed 
precision 6 

precision 8, left adjusted 


rd.. - 

wBimm 


Examples of formats used to print numbers are given in the next table. Again, we use 
double-quote characters to visually delimit the field. They do not get printed. 


Declarations and initializations 

1 

int i 

= 123; 


i 

double x = 

0.123456789; 


i 

I 

1 

Corresponding 

How it is printed in its 


Format 

! 

argument 

field i 

Remarks 



%10.5f 
%~12.5e 


1 1 -t '■v ii 

123 


”00123” 


” 173” 


”7b 


”0x7b 


” 0.12346” 


field width 3 by default 


padded with zeros 


right adjusted, octal 


left adjusted, hexadecimal 


left adjusted, hexadecimal 


field width 10, precision 5 
d 
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pi 2 The Input Function scanfQ 


WiM, „ i( .p nmnprties that allow flexible use at a high level. The 

^^SSYllsTlflrgurncnts of arbitrary length can be scanned, and the second is 
tefirst is that a h 8 ■ lp conversion specifications, or formats. 1 he func 

1 that the input is controll } standard input file stdi n. The argument list to 

gjjilon scanf O reads characters from the stanaaru my 

BtecanfO has tw0 P arts: 


St controLstring and other.arguments 

m 

, 

In the example 


char 

int 

double 


b, C, s[100]; 


scanf("%c%c%c%d%s%lf", &a, &b, &c, &n, s, &x) 


>» ■at •• 

If we have 


controLstring: 
other .arguments: 


"%c%c%c%d%s%l 

&a, &b, &c, &n, s, &x 


Ssgfjp: x/ V » *• w . — ^ s,/ 

Sl? r fmi a wino the control string consist of a comma-separated list of 

pi The other arguments to Mowing* e c0 , hat * , he pre eeding example, writing 4s 
m nointer expressions, or addresses. Note mat l 
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scanf() conversion characters 


Conversion 

character 


d, i 


u 


o 


x 

e 


X 


E, f, g, G 


P 


n, %, [ 


Characters in the input stream 
that are matched 


any character, including white space 


an optionally signed decimal integer 


an optionally signed decimal integer 


an optionally signed octal integer 


an optionally signed hexadecimal integer 


an optionally signed floating-point number 


a sequence of nonwhite space characters 


what is produced by %p in pri ntf (), usu¬ 
ally an unsigned hexadecimal integer 


see the next table 


Corresponding 
argument is a pointerto 


char 


integer 


unsigned integer 
unsigned integer 


unsigned integer 


a floating type 


char 


void 


* 


Three conversion characters are of a special nature, and one of these, [...], is not evil 
a character, although the construct is treated as such, 


scanf () conversion characters HI 

Conversion 

character 

3§fijf§ S 

Remarks Mt 

: 

n 

No characters in the input stream are matched. The corresponding argument 
is a pointer to an integer, into which gets stored the number of characters readII 
so far. jjf 

% 

A single % character in the input stream is matched. There is no corresponding! 
argument. '!jj 

E ■ * • 1 

The set of characters inside the brackets [ J is called the scan set. It determines ; 
what gets matched. (See the following explanation.) The corresponding argu- Jt 
ment is a pointer to the base of an array of characters that is large enough to |f 
hold the characters that are matched, including a terminating null character 
that is appended automatically. jjj 

. . . . . —irrt ''*? 
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e control string may contain 

white space, which matches optional white space in the input stream. 

ordinary nonwhite space characters, other than %, Each ordinary character must 
match the next character in the input stream, 

conversion specifications that begin with a % and end with a conversion character. 
Between the % and the conversion character, there may be an optional * that indi¬ 
cates assignment suppression, followed by an optional integer that defines a maxi- 
mum scan width, followed by an optional h, 1 , or L that modifies the specification 
character. 

* the modifier h, which can precede a d, i, o, u, x, X conversion character. It indicates 
that the converted value is to be stored in a short i nt or in an unsigned short 
i nt. 

the modifier 1, which can precede either ad, i, o, u, x, X conversion character or an 
e, E, f, g, G conversion character. In the first case, it indicates that the converted 
value is to be stored in a long i nt or in an unsigned long i nt. In the second 
case, it indicates that the converted value is to be stored in a double. 

the modifier L, which can precede an e, E, f, g, G conversion character. It indicates 
that the converted value is to be stored in a 1 ong doubl e. 

The characters in the input stream are converted to values according to the conver¬ 
ge sion specifications in the control string and placed at the address given by the corre¬ 
sponding pointer expression in the argument list. Except for character input, a scan 
field consists of contiguous nonwhite characters that are appropriate to the specified 
conversion. The scan field ends when a nonappropriate character is reached, or the 
scan width, if specified, is exhausted, whichever comes first. When a string is read in, it 
if is presumed that enough space has been allocated in memory to hold the string and an 
end-of-string sentinel \0, which will be appended. The format %ls can be used to read 
in the next nonwhite character. It should be stored in a character array of size at least 2. 
The format %nc can be used to read in the next n characters, including white space 
characters. When one or more characters are read in, white space is not skipped, As 
with strings, it is the programmer’s responsibility to allocate enough space to store 
these characters. In this case a null character is not appended. A format such as %1 f 
can be used to read in a doubl e. Floating numbers in the input stream are formatted as 
an optional sign followed by a digit string with an optional decimal point, followed by 
an optional exponent part. The exponential part consists of e or E, followed by an 
optional sign followed by a digit string. 
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11.3 ▼ The Functions fprintf(), fscanfC), sprintfC), and sscanf() 
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H 




A conversion specification of the form %[string] indicates that a special string is |1L 
be read in. If the first character in string is not a circumflex character a, then the str irinll 
is to be made up only of the characters in string. However, if the first character in string 
is a circumflex, then the string is to be made up of all characters other than those M 
string. Thus, the format %[abc] will input a string containing only the letters a, b, aiffl 

an st0 P ^ an y °lher character appears in the input stream, including a blank, 
The format %[A a bc] will input a string terminated by any of a, b, or c, but not by whlH 
space. The statement re 


the value 45 is placed in i, the comma is matched, the string "ignore_thi s" is 
jjgfjjgnored, the % is matched, the character C is placed in the variable c, the string " read_" 
(is placed in string[0] through string[5] with the terminating \0 in string[5], and 
finally the string "in_this**" is placed in string[5] through string[14], with 
mitri ng [14] containing \0. Because four conversions were successfully made, the value 
K 4 is returned by scanf()■ 


. 




- i 




scanf("%[AB \n\t]", s); 






will read into the character array s a string containing A’s, B’s, and the white space chaH 
acters blank, newline, and tab, #• 

These conversion specifications interact in a predictable way. The scan width is the 
number of characters scanned to retrieve the argument value. The default is whatevl^* 
is in the input stream. The specification %s skips white space and then reads in 
white space characters until a white space character is encountered or the end-of-file*ll$f^ 
mark is encountered, whichever comes first. In contrast to this, the specification %S'mHT 
skips white space and then reads in nonwhite characters, stopping when a white spadSf* 

character is encountered or an end-of-file mark is encountered or five characters havS 
been read in, whichever comes first. 

The function scanf () returns the number of successful conversions performed ThJffiKi 
value EOF is returned when the end-of-file mark is reached. Typically, this value is -O® 
The value 0 is returned when no successful conversions are performed, and this value Slltl 
is always different from EOF. An inappropriate character in the input stream can frus-f^» 
trate expected conversions, causing the value 0 to be returned. As long as the inpullSi 
stream can be matched to the control string, the input stream is scanned and values are H 
converted and assigned. The process stops if the input is inappropriate for the nexfB 
conversion specification. The value returned by scanf Q can be used to test that input# 
occuirecl as expected, or to test that the end of the file was reached. 

An example illustrating the use of scanf () is 


11.3 The Functions fprintfO, fscanfO, sprintfC) 

and sscanfQ 




Hr The functions fprintf () and fscanfO are file versions of the functions printfO 
jjjl and scanf (), respectively. Before we discuss their use, we need to know how C deals 
. with files. 

The identifier FILE is defined in stdio.h as a particular structure, with members that 
MR.---!describe the current state of a file. To use files, a programmer need not know any 
details concerning this structure. Also defined in stdio.h are the three file pointers 
snsstdin, stdout, and stderr. Even though they are pointers, we sometimes refer to 
them as files. 


Wk 




V, 


Bjj 


Written in C 

Name 

Remark 

stdin 

standard input file 

connected to the keyboard 

stdout 

standard output file 

connected to the screen 

stderr 

standard error file 

connected to the screen 


i nt i; 
char c; 
char string[15] 


sa m 


llisi; 


The function prototypes for file handling functions are given in stdio.h. Here are the 
prototypes for fprintfO and fscanfO: 


scanf( %d , %”s %% %c %Ss %s" , &i, &c, string, &string[5]) 
With the following characters in the input stream 

45 , ignore_this % C read_in_this** 


siti 


ISm 

mu 


int fprintf(FILE *fp, const char *format 
int fscanf(FILE *fp, const char ^format, 

A statement of the form 


) 


) 




fpri r\tf(file_ptr, controLstring, other^arguments) ; 

writes to the file pointed to by fi1e_ptr. The conventions for controLstring and 
other_arguments conform to those of printfO. In particular, 
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fprintf (stdout, is equivalent to printf (...); 

In a similar fashion, a statement of the form 

fscanf (/)7e_pfr , controLstring, other.arguments'); 
reads from the file pointed to by file_ptr. In particular, 

fscanf(stdin, ...); is equivalent to scanf (...); 
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11.4 


The Functions fopen() and fcloseO 
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In the next section, we will show how to use fopen () to open files and how toll 
fpri ntf () and fscanf () to access them. | 

The functions sprintfO and sscanfO are string versions of the function 
pri ntf () and scanf (), respectively. Their function prototypes, found in stdio.h, are 



sggti' 
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^ 








int 
i nt 


sprintf(char *s, const char ^format, ...) 
sscanf(const char *s, const char *format, 


B MBS-iT'' Aiwinawri'ia 

JBjlgSfli 


■ ■ J f 


The function sprintf () writes to its first argument, a pointer to char (string), instead 
of to the screen. Its remaining arguments conform to those for pri ntf (). The function 
sscanf() reads from its first argument instead of from the keyboard. Its remaini* 
arguments conform to those for scanf (). Consider the code 


m 

a?“S’iff-v. vV./ vV - 
asx 

va : 


WBMmm 




char 

int 


strl[] = 
a, b, c 


123 go", str2[100], tmp[100] 


sscanf(strl, "%d%d%d%s". &a, &b, Ac, tmp); 
sprintf(str2, "%s %s %d %d %d\n", tmp, tmp, a, b, c); 
pnntf("%s", str2); 



WmBmm! 

— 
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^Abstractly, a file can be thought of as a stream of characters. After a file has been 
Kggened, the stream can be accessed with file handling functions in the standard library. 
■■ In this section, we want to explain the use of fopen() and fcloseO. 
jjp Files have several important properties: They have a name. They must be opened and 
dosed. They can be written to, or read from, or appended to. Conceptually, until a file is 
■lopened nothing can be done to it. It is like a closed book. When it is opened, we can 
ji have access to it at its beg innin g or end. To prevent accidental misuse, we must tell the 
II; system which of the three activities—reading, writing, or appending—we will be per- 
forming on it. When we are finished using the file, we close it. Consider the following 
j| code: 

ffc; 

#inc1ude <stdio.h> 

jjls int main(void) 

He int a, sum = 0; 

FILE *ifp, *ofp; 


■ 

:-iK> 

rttcrs.yj, 

AP'.’ 

... 



ifp = fopen("my_fi1e", "r") 
ofp = fopen("outfile", "w") 


/* open for reading */ 
/* open for writing */ 






The function sscanfO takes its input from strl. It reads three decimal integers and „ 
string, putting them into a, b, c, and tmp, respectively. The function sprintfO writes 
to str2. More precisely, it writes characters in memory, beginning at the address stril 
Its output is two strings and three decimal integers. To see what is in str2, we invokjl 
pri ntf (). It prints the following on the screen: 


■ft-.-. - m 


w$SBg£m$z ‘ 
< L\\ >r?' 
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This opens two files in the current directory: myjtie for reading and outfile for writing. 
|) (The identifier i fp is mnemonic for “infile pointer,” and the identifier ofp is mnemonic 
;g for “outfile pointer.”) After a file has been opened, the file pointer is used exclusively in 
H all references to the file. Suppose that my_file contains integers. If we want to sum 
Iff them and put the result in outfile, we can write 


go go 1 2 3 


10%^.’":'’. ■- 


while (fscanf(ifp, "%d", &a) == 1) 
sum += a; 

fprintf(ofp, "The sum is %d.\n", sum); 

Note that fscanf (), like scanf (), returns the number of successful conversions. After 
we have finished using a file, we can write 

fclose(ifp) ; 

Tliis closes the file pointed to by i f p. 


, &a) == 1) 


• V, ■:?. 


Caution: It is the programmer’s responsibility to provide adequate space in memory! 
for the output of sprintfO. 

Reading from a string is unlike reading from a file in the following sense: If we use 

sscanfO to read from strl again, then the input starts at the beginning of the string,! 
not where we left off before. 
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A function call of the form f open (filename, mode ) opens the named file in a parting 
lar mode and returns a file pointer. There are a number of possibilities for the mode. 


Mode 




»i pfi 




eanmg 


open text file for reading 


open text file for writin 


open text file for append! 

*. w -A' 


open binary file for readi 


open binary file for writing 


open binary file for append! 


Each of these modes can end with a + character. This means that the file is to be open< 
for both reading and writing. 




11.5 ▼ An Example: Double Spacing a File 



1 
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1 11.5 An Example: Double Spacing a File 

Let us illustrate the use of some file handling functions by writing a program to double- 
space a file. In mainO, we open files for reading and writing that are passed as com¬ 
mand line arguments. After the files have been opened, we invoke doubl e_space() to 
! accomplish the task of double spacing. 


afs > 


■ In file dbLspace.c 




Mode 


M r+" 


n w+" 


Meaning 

open text file for reading and writing 
open text file for writing and reading 


A A * ~ 

; 

\ ; X ■*%) W'j 





m 

mm. z™. 



Opening for reading a file that cannot be read, or does not exist, will fail. In this case 
fopenO returns a NULL pointer. Opening a file for writing causes the file to be createcfjfn 
if it does not exist and causes it to be overwritten if it does. Opening a file in append A; 
mode causes the file to be created if it does not exist and causes writing to occur at the 
end of the file if it does. ; 

A file is opened for updating (both reading and writing) by using a + in the mode. 
However, between a read and a write or a write and a read there must be an intervening .■ 
call to ffl ush() to flush the buffer, or a call to one of the file positioning function cahQfj 
fseek(), fsetposO, or rewindO. if 

In some operating systems, including UNIX, there is no distinction between binarj|j|| 
and text files, except in their contents. The file mechanism is the same for both types df«f* 
files. In MS-DOS and other operating systems, there are different file mechanisms for ! 
each of the two types of files. (See exercise 22, on page 549, for further discussion.) J 
A detailed description of file handling functions such as fopen() and fcloseQ can 
be found in Section A.12, “Input/Output: <stdi o. h>,” on page 655. Consult the appen¬ 
dix as necessary to understand how the various functions are used. 
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#include <stdio.h> 

#incltide <stdlib.h> 

void double_space(FILE *, FILE *); 
void prn_info(char *); 


int inain(int argc, char **argv) 

{ 

FILE *ifp, *ofp; 

if (argc != 3) { 

prn_info(argv[0]); 
exit(l); 

} 

ifp = fopen(argv[l], "r"); 
ofp = fopen(argv[2], "w"); 
double_space(ifp, ofp); 
fclose(ifp); 
fclose(ofp); 
return 0; 


/* open for reading */ 
/* open for writing */ 


void double_space(FILE *ifp, FILE *ofp) 

{ 

int c; 

while C(c = getc(ifp)) != EOF) { 
putc(c, ofp); 
if (c == ’\n’) 

putc('\n', ofp); /* found a newline 

} 


dupli cate i *t ( / 


Jill 
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void prn_info(char *pgm_name) 

{ 

printf("\n%s%s%s\n\n%s%s\n\n", 

"Usage: ", pgm_name, " infile outfile", 

"The contents of infile will be double-spaced ", 
"and written to outfile."); 


Suppose we have compiled this program and put the executable code in the fiif* 
dblspace. When we give the command 

dbLspace file1 fiie2 J 

the program will read from file! and write to file2. The contents of file2 wall be the same 
as filed, except that every newline character will have been duplicated. I 










■ill 


i aNtfb 


print information about the program and exit() is invoked to exit the program. By 
% convention, exitQ returns a nonzero value when something has gone wrong. 


ifp = fopen(argv[l], 
ofp = fopen(argv[2], 


"r"); 
w"); 


h_.it 


/* open for reading */ 
/* open for writing */ 


mm. 








Ki8ilS£, 






iiSl : 
111 


§1$ We can think of argv as an array of strings. The function fopen() is used to open the 
Rift file named in argv[l] for reading. The pointer value returned by the function is 
1 assigned to ifp. In a similar fashion, the file named in argv[2] is opened for writing. 

JjT« double_space(ifp, ofp); 

fclfo v 

The two file pointers are passed as arguments to doubl e_space(), which then does the 
ft- work of double spacing. One can see that other functions of this form could be written 
ffe to perform whatever useful work on files was needed. 



Dissection of the dblspace Program 

■ finclude <stdio.h> 

#include <stdlib.h> 

void double_space(FILE *, FILE *); 

void prn_info(char *); 




‘i-- - 

. 

:: "-AT 


mm 




msmi 
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We have included stdlib.h because it contains the function prototype for exitQ, whichiP 
gets used in prn_info(). The identifier FILE is a structure defined in stdio.h. To make| 
use of files, we do not need to know system-implementation details of how the fileijp 
mechanism works. The function prototype for doubl e_space() shows that it takes twol{ 
file pointers as arguments. *g§|. 



« T 


Ki 

llltll 

ftli 1 
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- Jnt mainCint a rg c, char **ar 9 v) 

FILE *ifp, *ofp; 
if (argc != 3) { 

prn_info(argv[0]); ? 

exit(l); 

} 

the identifiers ifp and ofp are tile pointers. More explicitly, they are of type pointer to 
FILE. The program is designed to access two files entered as command line arguments. 
If there are too few’ or too many command line arguments, prn_info() is invoked to 



g® 




■ fclose(ifp); 
fclose(ofp); 

The function f cl ose () from the standard library is used to close the files pointed to by 
ifp and ofp. It is good programming style to close files explicitly in the same function 
in which they were opened. Any files not explicitly closed by the programmer are closed 
automatically by the system on program exit. 

■ void double_space(FILE -'ifp, FILE *ofp) 

{ 

int c; 

The identifier c is an i nt. Although it will be used to store characters obtained from a 
file, eventually it will be assigned the value EOF, which is not a character. 


while ((c = getc(ifp)) 
putc(c, ofp); 
if ( C == ’\n') 

putc('\n', ofp); 

} 


EOF) { 


/* found a newline - duplicate it */ 


The macro getcQ reads a character from the file pointed to by ifp and assigns the 
value to c. If the value of c is not. EOF, then putc() is used to write c into the file 
pointed to by ofp. If c is a newline character, another newline character is written into 
the file as well, double spacing the output file. This process continues repeatedly until 
an EOF is encountered. The macros getcQ and putcQ are defined in stdio.h. 


§mm 
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11.6 Using Temporary Files and Graceful Functions 




In ANSI C, the programmer can invoke the library function tmpfi 1 e() to create a tein-. (IS 
porary binary file that will be removed when it is closed or on program exit. The file is -ffl 
opened for updating with the mode "wb+". In MS-DOS, a binary file can also be used as 
a text file. In UNIX, binary and text files are the same. In this section, we write an ele¬ 
mentary program that illustrates the use of tmpfi le() and a graceful version ofttii 
fopenQ. lllijj 

The name of our program is dbl_with_caps. First, it reads the contents of a file into a 
temporary file, capitalizing any letters as it does so. Then the program adds the con- . < 
tents of the temporary file to the bottom of the first file. igS 
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FILE *gfopen(char *filename, char *mode) 

{ 

FILE *fp; 

if ((fp = fopen(filename, mode)) == NULL) { 

fprintf(stderr, "Cannot open %s - bye!\n", filename) 
exit(l); 

} 

return fp; 




n '^:fg! 
■mm 


In file dbl_with_caps.c 

/* Replicate a file with caps. */ 

#include <ctype.h> 

#include <stdio.h> 

#include <stdlib.h> 

FILE *gfopen(char *filename, char *mode); 

int main(int argc, char **argv) 

{ 

int c; 

FILE *fp, *tmp_fp; 
if (argc != 2) { 

fprintf(stderr, "\n%s%s%s\n\n%s\n\n", 

"Usage: ", argv[0], " filename", 

"The file will be doubled and some letters capitalized."); 
exit(l); 

} 

fp = gfopen(argv[l], "r+"); 
tmp_fp = tmpfi1e(); 
while ((c = getc(fp)) != EOF) 
putc(toupper(c), tmp_fp); 
rewind(tmp_fp); 

fprintf(fp, "-\n"); 

while ((c = getc(tmp„fp)) != EOF) 
putc(c, fp); 
return 0; 


#1 












mi Before we explain the program, let us see its effects. Suppose that in file apple we have 


the line 








mm mm 
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A is for apple and alphabet pie. 
After we give the command 

dbLwith_caps apple 
the contents of the file will be 

A is for apple and alphabet pie. 
A IS FOR APPLE AND ALPHABET PIE. 


SigipfW 


Dissection of the dbLwith_caps Program 

■ fp = gfopen(argv[l], "r+"); 

We are using a graceful version of fopen () to open a file for both reading and writing. 
If for some reason the file cannot be opened, a message will be printed and the program 
exited. 

■ tmp_fp = tmpfi leQ; 

ANSI C provides the function tmpfi le() to open a temporary file. The file mode is 
"wb+". On program exit, the file will be removed by the system. (See Section A.12, 
“Input/Output: <stdi o, h>,” on page 656, for the function prototype and other details.) 
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■ while ((c = getc(fp)) != EOF) 
putc(toupper(c), tmp_fp); 

The macros getcQ and putc() are defined in stdio.h. They are being used to read from 
one file and to write to another. The function prototype for toupperQ is given ini 
ctype.h. If c is a lowercase letter, toupper(c) returns the corresponding uppercase let 
ter; otherwise, it returns c. Caution: Some ANSI C compilers do not get this right; w|j| 
hope that they will improve with time. (See exercise 12, on page 400, in Chapter 8, “The 
Preprocessor,” for further discussion.) You may have to write . J 


while C(c = getc(fp)) != EOF) 
if (islower(c)) 

putc(toupper(c), tmp_fp); 

’‘I 

e I se 

putcfc, tmp_fp); 







m 








. 




■ rewind (tmp_fp); 

This causes the file position indicator for the stream pointed to by tinp_f p to be set to 
the beginning of the file. This statement is equivalent to 


ill 




ll®N 




■ fseek(tmp_fp, 0, 0); 

See Section A.12, “Input/Output: <stdi o. h>,” on page 657, for the function prototypes 
and for an explanation of fseekQ. 


mmm 






mm 
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fprintf(fp, "—\n"); 
while ((c = getc(tmp_fp)) 
putc(c, fp); 


EOF) 


Now we are reading from the stream pointed to by tmp_fp and writing to the streams* 
pointed to by fp. Note that a call to rewi nd() occurred before the switch from writing 
to reading on the stream pointed to by tmp_f p. f: 

H FILE *gfopen(char *filename, char *mode) 

{ 




..;T 







* * A k 


This is a graceful version of fopen(). If something goes wrong, a message is printed on gig 
the screen and we exit the program. Note that we wrote to stderr. In this program, we 
could just as well have written to stdout. However, in other programs that use this|§| 
function, there is an advantage to writing to stderr. (See exercise 1, on page 542.) 
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. 

11.7 Accessing a File Randomly 

The library functions fseek() and ftel 1 () are used to access a file randomly. An 
expression of the form 

ftel 1 (file_ptr) 

returns the current value of the file position indicator. The value represents the number 
of bytes from the beginning of the file, counting from zero. Whenever a character is 
read from the file, the system increments the position indicator by 1. Technically, the 
file position indicator is a member of the structure pointed to by file_ptr. Caution: The 
file pointer itself does not point to individual characters in the stream. This is a concep¬ 
tual mistake that many beginning programmers make. 

The function fseek() takes three arguments: a file pointer, an integer offset:, and an 
integer that indicates the place in the file from which the offset should be computed. A 
statement of the form 

fseek(.fiie_ptr , offset, place); 

sets the file position indicator to a value that represents offset bytes from place. The 
value for place can be 0, 1, or 2, meaning the beginning of the file, the current position, 
or the end of the file, respectively. Caution: The functions fseekO and ftel 1 () are 
guaranteed to work properly only on binary files. In MS-DOS, if we Wcint to use these 
functions, the file should be opened with a binary mode. In UNIX, any file mode will 
work. 

A common exercise is to write a file backwards. In the following program, the prompt 
to the user is written to stderr so that the program will work with redirection. (See 
exercise 1, on page 542.) We opened the file with mode "rb" so that the program will 
work in both MS-DOS and UNIX. 

Here is a program that writes a file backwards: 
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In file backward.c 


/* Write a file backwards. */ 


find ude <stdio.h> 
#define MAXSTRING 


int main(void) 

{ 

char fname[MAXSTRING]; 
int c; 

FILE *ifp; 

fprintf(stderr, "\nlnput a filename 
scanf("%s", fname); 
ifp = fopen(fname, "rb"); /* b 

fseek(ifp, 0, SEEK_END); /* m 

fseek(ifp, -1, SEEK_CUR); /* b 

while (ftell(ifp) > 0) { 

c = getc(ifp); /* m 

putchar(c); 

fseek(ifp, -2, SEEK_CUR); /* b 

} 

return 0; 


"); 


/* binary mode for MS_DOS */ 

/* move to end of the file */ 
/* back up one character */ 

/* move ahead one character */ 

/* back up two characters */ 


11.8 File Descriptor Input/Output 

A file descriptor is a nonnegative integer associated with a file. In this section, we 
describe a set of library functions that are used with file descriptors. Although these 
functions are not part of ANSI C, they are available on most C systems in both MS-DOS 
and UNIX. Because of minor differences, care is needed when porting code from UNIX 
to MS-DOS or vice versa. 


File name Associated file descriptor 

standard input 0 

standard output 1 

standard error 2 
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Functions in the standard library that use a pointer to FILE are usually buffered. In 
contrast, functions that use file descriptors may require programmer-specified buffers. 
Let us illustrate the use of file descriptors with a program that reads from one file and 
writes to another, changing the case of each letter. 

n file change_case.c 

/* Change the case of letters in a file. */ 

#include <ctype.h> 

#include <fcntl.h> 

#include <unistd.h> /* use io.h in MS_DOS */ 

#define BUFSIZE 1024 

int main(int argc, char **argv) 

{ 

char mybuf[BUFSIZE], w p; 
int in_fd, out_fd, n; 


in_fd = open(argv[l], O.RDONLY); 

out_fd = open(argv[2], 0_WR0NLY | 0_EXCL | 0_CREAT, 0600); 
while ((n = read(in_fd, mybuf, BUFSIZE)) > 0) { 
for (p = mybuf; p - mybuf < n; ++p) 
if (islower(*p)) 

*p = toupper(*p); 
else if (isupper(*p)) 

*p = tolower(*p); % 
write(out_fd, mybuf, n); 

} 

close(in_fd); 
close(out_fd); 
return 0; 






Dissection of the change_case Program 


#include <ctype.h> 
#include <fcntl.h> 
#include <unistd.h> 


/* use io.h in MS-DOS */ 


The header file fcntl.h contains symbolic constants that we are going to use. The header 
file unistd.h contains the function prototypes for openQ and read(). In MS-DOS, we 
would include io.h instead. 
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in_fd = open(argv[l], 0_RD0NLY) 


The first argument to openQ is a file name; the second argument specifies how the file 
is to be opened. If there are no errors, the function returns a file descriptor; otherwise, 
the value -1 is returned. The identifier in_fd is mnemonic for “in file descriptor." On 
both MS-DOS and UNIX systems, the symbolic constant CLRDONLY is given in fcntl.h. It is 
mnemonic for “open for reading only.” 


out_fd = open(argv[2], CLWRONLY | 0_EXCL | 0_CREAT, 0600); 


Lhe symbolic constants in fcntl.h that are used to open a file can be combined with the 
bitwise OR operator. Here, we are specifying that the file is to be opened for writing 
only, that the file is to be opened exclusively (meaning that it is an error if the file 
already exists), and that the file is to be created if it does not exist. CLEXCL gets used 
only with 0_CREAT. If the file is created, then the third argument sets the file permis¬ 
sions; otherwise, the argument has no effect. We will explain about file permissions 
below. 


while C(n = read(in_fd, mybuf, BUFSIZE)) > 0) { 


A maximum of BUFSIZE characters are read from the stream associated with i n_fd and 
put into mybuf. The number of characters read is returned. The body of the whi 1 e loop 
is executed as long as read() is able to get characters from the stream. In the body of 
the loop the letters in mybuf are changed to the opposite case. 


write(out_fd, mybuf, n); 


n characters in mybuf are written to the stream indicated by out_fd. 


close(in_fd); 
close(out_fd) 


This closes the two files. If the files are not explicitly closed by the programmer, the 
system will close them on program exit. 



■ 



C aution: This program is not user-friendly. If you give the command 


change_case file1 file2 


and file'2 already exists, then the file does not get overwritten, which is the behavior we 
wanted. A better-designed program would say something to the user in this case. 
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11.9 File Access Permissions 

In UNIX, a file is created with associated access permissions. The permissions deter¬ 
mine access to the file for the owner, the group, and for others. The access can be read, 
write, execute, or any combination of these, including none. When a file is created by 
invoking open(), a three-digit octal integer can be used as the third argument to set the 
permissions. Each octal digit controls read, write, and execute permissions. The first 
octal digit controls permissions for the user, the second octal digit controls permis¬ 
sions for the group, and the third octal digit controls permissions for others. (“Others” 
includes everybody.) 


Meaning of each octal digit in the file permissions 

Mnemonic 

Bit representation 

Octal representation 

r— 

100 

04 


010 « 

02 

—X 

001 

01 

rw~ 

110 

06 

r~x 

101 

05 ! 

-wx 

Oil 

03 

rwx 

111 

07 


Now, if we pack three octal digits together into one number, we get the file access per¬ 
missions. The mnemonic representation is the easiest to remember. The first, second, 
and third group of three letters refers to the user, the group, and others, respectively. 


Examples of file access permissions 

Mnemonic 

Octal representation 


0600 


0604 

rwxr-xr-x 

0755 

rwxrwxrwx 

0777 

--—---- 
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The permissions rwxr-xr-x mean that the owner can read, write, and execute the file; 
that the group can read and execute the file; and that others can read and execute the 
file. In UNIX, the mnemonic file access permissions are displayed with the Is ~/ com¬ 
mand. In MS-DOS, file permissions exist, but only for everybody. 


11.10 Executing Commands from Within a C Program 


The library function system() provides access to operating system commands. In both 
MS-DOS and UNIX, the command date causes the current date to be printed on the 
screen. If we want this information printed on the screen from within a program, we 
can write 


system("date"); 


The string passed to system() is treated as an operating system command. When the 
statement is executed, control is passed to the operating system, the command is exe¬ 
cuted, and then control is passed back to the program. 

In UNIX, vi is a commonly used text editor. Suppose that from inside a program we 
want to use vi to edit a file that has been given as a command line argument. We can 
write 


char command[MAXSTRING]; 


sprintf(command, "vi %s", argv[l]); 

printf("vi on the file %s is coming up .,.\n", argv[l]); 
system(command); 


A similar example works in MS-DOS, provided we replace vi by an editor that is avail¬ 
able on that system. 

As a final example, let us suppose we are tired of looking at all those capital letters 
produced by the dir command on our MS-DOS system. We can write a program that 
interfaces with this command and writes only lowercase letters on the screen. 
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hi file lower_case.c 

/it ignite only lowercase on "the screen. / 

#include <ctype.h> 

#include <stdio.h> 

#include <stdlib.h> 

#define MAXSTRING 100 

int main (void) 

{ char command[MAXSTRING], *tmp_filename; 

int c; 

FILE *ifp; 

tmp_fi1ename = tmpnam(NULL); 

sprintf(command, "dir > %s" , tmp_fi1ename); 

system(command); 

ifp = fopen(tmp_filename, r ); 

while ((c = getc(ifp)) != EOF) 

putchar(tolower(c)); 
remove(tmp_filename); 
return 0; 

} 

First we use the library function tmpnamQ to create a temporary tile name. Then we 
invoke system () to redirect the output of the dir command into the temporary file. 
Then we print on the screen the contents of the file, changing each uppercase letter to 
lowercase. Finally, when we are finished using the temporary iile, we invoke the library 
function remove () to remove it. (See Section A. 12, “Input/Output: <stdio.h>,” on page 
662, for details about these functions.) 
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11.11 Using Pipes from Within a C Program 


v ■■ 


UNIX systems provide popen() and pcloseC) to communicate with the operating sys- 
tern. These functions are not available in MS-DOS systems. Suppose we are tired of look- 1 
mg at all those lowercase letters produced by the Is command on our UNIX system. 




In file uppercase.c 


■'■TTU 






^include <ctype.h> 
#include <stdio.h> 






mtm 


int main(void) 

{ 

int c; 
FILE *ifp 


>.-xV rjf}' 


} 


ifp = popen("ls", "r"); 
while ((c = getc(ifp)) != EOF) 
putchar(toupper(c)); 
pclosefifp); 
return 0; 










The first argument to popenQ is a string that is interpreted as a command to the open 

ating system; the second argument is a file mode, either "r” or "w". When the function 

is invoked, it creates a pipe (hence the name popen) between the calling environment 

and the system command that is executed. Here, we get access to whatever is produced 

by the Is command. Because access to the stream pointed to by i fp is via a pipe, we 

cannot use file positioning functions. For example, rewind (ifp) will not work. We can 

onl y access the characters sequentially. A stream opened by popen () should be closed 

by pcloseC). If the stream is not closed explicitly, then it will be closed by the system 
on program exit. 
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11.12 Environment Variables 

Environment variables are available in both UNIX and MS-DOS. The following program 
can be used to print them on the screen: 

#include <stdio.h> 

int main(int argc, char *argv[], char *env[]) 

{ 

int i; 

for (i =0; env[i] != NULL; ++i) 
printf("%s\n", env[i]); 
return 0; 

> 

The third argument to tnain() is a pointer to pointer to char, which we can think of as 
an array of pointers to char, or as an array of strings. The system provides the strings, 
including the space for them. The last element in the array env is a NULL pointer. On 
our UNIX system, this program prints 

H0ME=/c/c/blufox/center_manifol d 
SHELL=/bin/csh 
TERM=vtl02 
USER=blufox 


To the left of the equal sign is the environment variable; to the right of the equal sign is 
its value, which should be thought of as a string. On our MS-DOS system, this prograny 

prints / 

! 

C0MSPEC=C:\C0MMAND.COM 
BASE=d:\base 
INCLUDE=d:\msc\include 


The UNTX system provides a command to display the environment variables, but th.e 
command depends on which shell you are running. In the C shell, the command is 
printenv, whereas in the Bourne shell and in MS-DOS, the command is set. The output of 
the command is the same as the output of our program. 
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By convention, environment variables are usually capitalized. In a C program, we can % 
use the library lunction getenvQ to access the value of an environment variable* 
passed as an argument. Here is an example of how getenvQ can be used: 


Mrff.'t'. "A- 

14 . * ■. . :-.^3 

llllipipsfer. 

smimmsi ;• • ,.;ji 

5 ^--vr - vr ? 


Pirn if 

. - ii 1 * 

:g r , 

■*;:v « 


printf("%s%s\n%s%s\n%s%s\n%s%s\n", 

[[ Name: ", getenv("NAME") , 

" User: ", getenv("USER"), 

Shell: ", getenv("SHELL"), 

'Home directory: ", getenv("HOME")); 

The function prototype is provided in stdlib.h. If the string passed as an argument is 
not an environment variable, the NULL pointer is returned. 
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Command 


cc 

acc 

“be 

bcc 

gcc 

he 

occ 

qc 

tc 

tcc 


The C compiler that gets invoked 


The system supplied native C compiler 

• i ____ 

An early version of an ANSI C compiler from Sun Microsystems 
Borland C/C ++ compiler, integrated environment 
Borland C/C++ compiler, command line version 
GNU C com piler from the Free Software Foundation 
High C compiler from Metaware 
pregon C compiler from Oregon Software 
Quick C compiler from Microsoft ~ ~ 

/Turbo C compiler, integrated environment, from Borland 
Turbo C compiler, command line version, from Borland 


In t , 1 ^f Iv section ’ wo discuss some of the options that can be used with the cc command 
on UNIX systems. Other compilers provide similar options. 

If a complete program is contained in a single file, say pgm.c, then the command 
cc pgm.c 




11.13 The C Compiler 

There are many C compilers, and an operating system may provide any number of 
them. Here are just a few of the possibilities: 
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translates the C code in pgm.c into executable object code and writes it in the file a.out. 
(In MS-DOS, the executable file is pgm.exe.) The command a.out executes the program. 
The next cc command will overwrite whatever is in a.out. if we give the command 

cc -o pgm pgm.c 

:: then the executable code will, instead, be written directly into the file pgm, whatever is 
I in a.out will not be disturbed. 

| The cc command actually does its work in three stages: First, the preprocessor is 
I invoked, then the compiler, and finally the loader. The loader, or linker, is what puts all 
the pieces together to make the final executable file. The -c option can be used to eom- 
pile only—that is, to invoke the preprocessor and the compiler, but not the loader. This 
| is useful if we have a program written in more than one file. Consider the command 

| cc -c main.c filel.c file2.c 

If there are no errors, corresponding object files ending in .o will be created. To create 
an executable file, we can compile a mixture of .c and .o files. Suppose, for example, 
that we have an error in main.c. We can correct the error in main.c and then give the 
command 

cc -o pgm main.c filel.o file2.o 
The use of .o files in place of .c files reduces compilation time, 


c 


g 


-o name 


P 


v 


D name=def 



I dir 


-M 


Some useful options to the compiler 


Compile only, generate corresponding ,o files. 


Generate code suitable for the debugger. 


Put executable output code in name. 


Generate code suitable for the profiler. 


Verbose option, generates a lot of information. 


Place at the top of each .c file the line. 


#define name def. 


Invoke the preprocessor but not the compiler. 


Look for #i ncl ude files in the directory dir. 


Make a makefile. 



-0 

p—-—--- 

i Attempt code optimization. 

-s 

Generate assembler code in corresponding .s files. 
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mm 


may use different flag characters. Compilers in MS-DOS usually support different 


Your compiler may not support all of these options, and it may provide others. OiftfS 


ory models. Consult the documentation for your compiler for a detailed list of options. 
Suggestion: If you have never tried the -v option, do so. Some compilers produce a tofllj 
of information, and this information can be very useful when you are trying to under-: 
stand all the details about the compilation process. 


11.14 Using the Profiler 


In UNIX, the -p option used with cc causes the compiler to produce code that counts th|J 
number of times each routine is called. If the compiler creates an executable file, then It 
is arranged so that the library function moni tor () is automatically invoked and a file 


mon.out is created. The file man. out is then used by the prof command to generate an 

Jg 

execution profile. 


As an example of how this all works, suppose we want an execution profile of the 


quicksort routine that we wrote In Section 8.5, “An Example: Sorting with qsort() 
page 372. Here is our function mai n(): 


In file main.c 


#include <stdio.h> 
#include <stdlib.h> 
#include <time.h> 


#define N 


50000 


void quicksort(int *, int *) 
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int main(void) 

i nt a[N] , i ; 
srand (time (NULL)); 

for (i =0; i < N; ++i) 
a[i] => randQ % 10000; 
quicksorts, a + N - 1); 
for(i=0;i < N-l; ++i) 
if (a[i] > a[i +1]) f 
printf ("SORTING ERROR 
exit(l); 

} 

return 0; 


bye!\n"); 


> 

To obtain an execution profile of our program, we first compile it with the -p option: 

cc -p "o quicksort twain-c quicksorts 
Next, we give the command 
quicksort 

This causes the file mon.out to be created. Finally, to get an execution profile, we give 
the command 

prof quicksort 

This causes the following to be printed on the screen. 


cumsecs 
7.18 
9.64 
11.43 
13.08 
14.13 
15.12 
15.33 
15.33 
15.33 
15.33 
15.33 

named functions are user-defined; some of them, such as _gettimeofday, 
routines. An execution profile such as this can be very useful when working 

execution time efficiency. 


ol i me 
46.9 
16.1 

11.7 

10.8 

6.9 

6.4 

1.4 
0.0 
0.0 
0.0 
0.0 

Not all the 
are system 
to improve 


#call 

ms/call 

name 

9931 

0.72 

_partition 

1 

2460.83 

_mai n 

19863 

0.09 

_find_pivot 
mcount 

50000 

0.02 

_rand 

19863 

0.05 

_quicksort 
_monstartup 

1 

0.00 

_gettimeofday 

1 

0.00 

_profi) 

1 

0.00 

_srand 

1 

0.00 

_time 
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11.15 Libraries 



Many operating systems provide a utility to create and manage libraries. In UNIX, the 
utility is called the archiver, and it is invoked with the ar command. In the MS-Ddg, 
world, this utility is called the librarian, and it is an add-on feature. The Microsoft : 
librarian, for example, is lib, whereas the Turbo C librarian is tlib. By convention, library's 
files end in .a in UNIX, and end in .lib in MS-DOS. We will discuss the situation as it neillS 
tains to UNIX, but the general ideas apply to any librarian. 

In UNIX, the standard C library is usually in the file /lib/libc.a, but the standard! 
library can exist wholly or in part in other files as well. If UNIX is available to you, f Mftll 
the command 


... 


IKS- 


mm 1 ■ 

i 


sgav- .v-;i 


m • 

jH#Ki; 


ar t /usr/Iih/IibcM 


mm 


The key f is used to display titles, or names, of files in the library. There are more titles'-S 
than you care to look at. To count them, you can give the command 




ar t /usr/lib/libc.a / wc 


!1#Slr 


•S-l mm 

i sJUMig 


' ' 


This pipes the output of the ar command to the input of the wc command, causinS 
lines, words, and characters to be counted. (The name wc stands for “word count.”) It is E! 
not too surprising that the standard library grows with time. On a DEC VAX 11/780 lU 
from the 1980s, the standard library contained 311 object files. On a Sun machine that/llHf 
was relatively new in 1990, the standard library contained 498 object files. On the Sun 
machine that we happen to be using today, the number of object files is 563. $ 


ifcfl 

mmm 


mm 
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Let us illustrate how programmers can create and use libraries of their own. We will 


Wmm 




is.. 


do this in the context of creating a “graceful library." In Section 11.6, “Using Temporar| , :i» 
Files and Graceful Functions," on page 511, we presented gfopenQ, a graceful versio3|SS 
ol fopen (). In a directory named gjib, we have 11 such graceful functions, and we con-^| 
tinue to add more from time to time. These are functions such as gfcloseQ, gcal/ljj 
loc(), gmallocQ, and so forth. Each is written in a separate file, but for the purpose:® 




US: 


ms 




ipii 

mm 


of building a library, they could just as well be in one file. To reinforce the idea of these 1 . J§ 
functions, we give the code for gcal 1 ocQ: 






ivV : ! 






HISS; 


: V- 




wy- 


.... ..W 


■■.sm 


W.'i; 
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#include <stdio.h> 

#include <stdlib.h> 

void ,v gcalloc(int n, unsigned sizeof„something) 

{ 

void *p; 

if ((p = callocfn, sizeof_something)) == NULL) { 

fprintffstderr, "\nERR0R: callocC) failed - bye.\n\n"); 
exit(l); 

} 

return p; 

} 

To create our library, we must first compile the .c files to obtain corresponding .o files. 
After we have done this, we give the two commands 

ar ruv gjib.a gfopen.o gfdose.o gcaiioc.o ... 
ranlib gjib.a 

The keys ruv in the first command stand for replace, update, and verbose, respectively. 
This command causes the library gjib.a to be created if it does not already exist. If it 
does exist, then the named ,o files replace those of the same name already in the 
library. If any of the named .o files are not in the library, they are added to it. The ranlib 
command is used to randomize the library in a form that is useful for the loader. 

Now suppose we are writing a program that consists of main.c and two other .c files. 
If our program invokes gfopenO, we need to make our library available to the com¬ 
piler. The following command does this: 

cc -o pgm main.c filel.c file2.c gjib.a 

If our program invokes a function and does not supply its function definition, then it 
will be searched for, first in gjib.a and then in the standard library. Only those func¬ 
tions that are needed will be loaded into the final executable file. 

If we write lots of programs, each consisting of many files, then each of the programs 
should be written in its own directory. Also, w'e should have a separate directory for our 
libraries such as gjib.a, and anolher directory for associated header files such as 
gjib.h. For each function in gjib.a, wc put its prototype in gjib.h. This header file then 
gets included where needed. To manage all of this, we use the make utility. (See Section 
11.17, “The Use of make,” on page 532.) 
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1.16 How to Time C Code 

Most operating systems provide access to the underlying machine’s internal clock. In|j 
this section, we show how to use some timing functions. Because our functions are 
meant to be used in many programs, we put them into the library u_.lib.ci, our utility 
library. In Section 11.17, “The Use of make,” on page 532, we will discuss a progranil 
that uses functions from both of our libraries gjib.a and uJib.a. 

Access to the machine’s clock is made available in ANSI C through a number of funcjl 
tions whose prototypes are in time.h. This header file also contains a number of other! 
constructs, including the type definitions for c1ock_t and ti me_t, which are useful for § 
dealing with time. Typically, the two type definitions are given by 


typedef 

typedef 


long 

long 


clock_t 
time_t; 


..wm 




and in turn, these types are used in the function prototypes. Here are the prototypes 
for the three functions we will use in our timing routines: 


clock_t 

time_t 

double 


clock(void); 
time(time_t *p); 

difftime(time_t timel, time_t time0); 


. 

--S:; tmm 

■■ Mii. 
, 


m 


When a program is executed, the operating system keeps track of the processor time 1 
that is being used. When clockO is invoked, the value returned is the system’s bestjjjj 
approximation to the time used by the program up to that point. The clock units can! 
vary from one machine to another. The macro 


fdefine CLOCKS_PER_SEC 60 


/* machine-dependent */ 






plgi 


stt 


is provided in time.h. It can be used to convert the value returned by clockO to sec- fjjjp 
onds. Caution: In preliminary versions of ANSI C, this macro was called CLI<_TCK. 

The function timeO returns the number of seconds that have elapsed since 1 Janu¬ 
ary 1970, Other units and other starting dates are possible, but these are the ones typi¬ 
cally used. If the pointer argument passed to time() is not NULL, then the value 
returned gets assigned to the variable pointed to as well. One typical use is 

srand(time(NULL)); 

This seeds the random-number generator. If two values produced by time() are passed 
to difftimeC), the difference expressed in seconds is returned as a double. 
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w r t0 p re sent a set of timing routines that can be used for many purposes. 
We want to present a we keep these functions m the lile 

including the development ol efhcient code, we Keep 

time-keeper.c. 

#include <stdio.h> 

#include <stdlib.h> 

#include <time.h> 

#define MAXSTRING 100 
typedef struct { 

cl ock_t begi n_clock, save_cioci<., 
time_t begin„time, save_time, 

} time_keeper; 

. , +. 1 ,. /* known only to this file "/ 

static time_keeper tk; / y 

void start_time(void) 

^ tu heain clock = tk.save_clock = clockO; 
tk.begin„cioci ^ save _ t ime = time(NULL); 




'HWZC: 


A 


c 


m 




■mM 


tk.begin_time 


} 


double prn_time(void) 

{ char si[MAXSTRING], s2[MAXSTRING]; 

int field_width, nl, n2; nr „, pen c F r 

double clocks_per_second = (double) CLOCKS_PER_S , 

user_time, real_time; 

user time = (clockO - tk.save_clock) / clocks per.second; 
realZtime = difftime(time(NULL), tk.save_time), 
tk.save_clock = clock0; 
tk.save_time = time(NULL), 

/* print the values found, and do it neatly */ 


nl 


} 


nx = sprintf(si, "%.lf", user.time); 
n2 = sprintf(s2, 1f %*If” 5 real—time) * 
field_width = (nl > n2) ? nl : n2; 

nrintf("%s%*.lf%s\n%s%*.lf/(s\n\n , 

"User time: ", field_width, user.time, 
"Real time: ", field_width, real_time, 
return user_time; 


M 


seconds", 
seconds"); 


!®i 
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Note that the structure tk is external to the functions and is known only in this file. It is 
used for communication between the functions. When start_time() is invoiced, the I 
values returned from clocl<() and time() are stored in tk. When prn_time() is J| 
invoked, new values from clock() and time() are used to compute and print theHl 
elapsed user time and the elapsed real time, and new values are stored in tk. User time H 
is whatever the system allocates to the running of the program; real time is wall-clock 
time. In a time-shared system they need not be the same. ;« 

The function prn_total_time() is also in the file, but is not shown. It is similar to 3 
prn_time(), except that the elapsed times are computed relative to the last invocation S§ 
of start__ti me () rather than the last invocation of any of the three functions. « 

Because our timing routines are meant to be used in a variety of programs, we put IS 
them into ujib.a , our utility library. The following commands do this: J§ 


cc -c time_keeper.c; 

ar ruv ujib.a time_keeper.o; 

ranlib ujib.a 


In the header file ujib.h , we put the prototypes of the functions in ujib.a. The header 
file then gets included elsewhere as needed. 

Now we demonstrate how our timing routines can be used. In certain applications, 
fast floating-point multiplication is desired. Should we use variables of type float or 
doubl e? The following program can be used to test this: 

In file mult_time.c 


IRfesr-.j 


> 

3 

i 

life' ■■ 

•I 


/* Compare float and double multiplication times. */ 


#include <stdio,h> 
#include ”u_lib.h” 


#define 


100000000 


/* one hundred million */ 


* i 


•I 


iflllMp 

V fMW 


s' - 

. . . • 
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int main (void) 

{ 

long i; 

float a, b = 3.333, c = 5.555; /* arbitrary values */ 

double x, y = 3.333, z = 5.555; 

printf(’’Number of multiplies: %d\n\n”, N) ; 
printf("Type float:\n\n”); 
start_time(); 
for (i =0; i < N; ++i) 
a = b * c; 
prn„time(); 

printf( M Type double:\n\n”); 
for (i =0; i < N; ++i) 
x = y * z; 
prn_time(); 
return 0; 

} 

On an older machine with a traditional C compiler, we find, much to our surprise, that 
single-precision multiplication is slower than double-precision multiplication! In tradi¬ 
tional C, any float is automatically promoted to a doubl e. Perhaps the results are due 
to the time it takes the machine to do the conversion. On another machine with an ANSI 
C compiler, we find that single-precision multiplication is about 30 percent faster. This 
is in line with what we expected. When we try the program on a Sun Sparcstation 10, we 
obtained the following results: 

Number of multiplies: 100000000 
Type float: 

User time: 33.6 seconds 
Real time: 34.0 seconds 
Type double: 

User time: 33.5 seconds 
Real time: 33.0 seconds 

Once again we are surprised! We expected that multiplication with f 1 oats would yield a 
30 percent saving in time, but that is not the case. Note that the real time listed for 
doubl es is less than the user time. This happens because the clocking mechanism as a 
whole is only approximate. Caution: To get an accurate measure of machine multiplica¬ 
tion time, the overhead of the for loops themselves, as well as general program over¬ 
head (relatively negligible), has to be taken into account. 
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mem access to libraries and their associated header files This f w 9sM 

available in UNIX and often ayailable in MS-DOS, where'i, N an add ™ 

greatly facilitates both the construction and the maintPrvmrV J dd feature - J ts use 
Let us suppose we ire whh„ 1 maintenance of programs. |;fjl 

Tvniraiitr PP , writm 8 a program that consists of a number of h w MlS 
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tea s a file whose default name is makefile This file contains the a & ^ & COmraand 
various modules, or files makin,, n, ' 1S , contams ,hc dependencies of the- 1 

taken, in partial ar it contMnsT Pr ° gr ™' along ™ th appropriate actions to Wk 
gram. Sue? a ffle "cal.ed TZlmT * *" ” re “ mp ‘“" g . «« Pro 

P * 1 ™ have a Ptngfam contained in t„„ files, ,M 

We want the executable code foft? ' ^ is induded <» «•* of the .c S i 

makefile that can “u ed“o, D ” ™ fT* "* *• »« * a staplll 

GQ 101 pi ogram development and maintenance: M 

\ • • 


H>»S5S8ts®' x .;;fS#sj5t8|^pSS 





The second rule in our makefile states that main.o depends on the two files main.c 
n d sum.h. If either of these two files is changed, then the action line shows what must 
e done to update main.o. After this makefile has been created, the programmer can 
compile or recompile the program sum by giving the command 


HIT Sk 



H- make 

With this command, make reads the file makefile , creates for itself a dependency tree, 
feid takes whatever action is necessary. 

The dependency tree used internally by make 






main.o 


sum.o 




main.c 


sum.h 


sum.c 


sum. h 


; -/c.- v 


sum. main.o sum.o Tgig||{ 

main n CC _< ? SUm mai ’ n • 0 SUm.O '1 

main.o: main.c sum.h lift 

cc -c main.c : 'S2A 

sum.o: sum.c sum.h tl^ra 

cc -C sum.c Mi 

- j I 

MMis anSmpk o^a'Mnd' *7 dt ' P ™ ds °” the two “>»« files mai n.o and® 

indicates how the wogram iff * 1,1 “ h ‘™ 1. The second taj*l 

changed. I, is C1 , a f“Z 1' C ° mP " td “ «“ « of the .„ files have been "l 

following a dependency line " depeldenlvT" ,T* ^ bCmore ' han «* *«» ' 1 
up what is called a rule Canting- c nCy me and 1he action Hues that follow it make 

the screen, a lab chaMer ',o kMMlZ a tab character. On 31 
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B)' deiault, the make . -vf-.t 

But dependent files in tha™ e L™ h Me, ?f,? k linds ™ 'he makefile. 
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Certain rules are built into make , including the rule that a ,o file depends on the cor¬ 
responding x file. Because of this, an equivalent makefile is given by 

sum: main.o sum.o 
€ cc —0 sum main.o sum.o 

main.o: sum.h 

cc -c main.c 
sum.o: sum.h 

cc -c sum.c 

The make utility recognizes a number of built-in macros. Using one of them, we get yet 
another equivalent makefile: 

sum: main.o sum.o 

cc -o sum main.o sum.o 
main.o sum.o: sum.h 
cc -c $*.c 

Here, the second rule states that the two .o files depend on sum.h. if we edit sum.h , then 
both main.o and sum.o must be remade. The macro $* . c expands to mai n . c when 
main.o is being made, and it expands to sum.c when sum.o is being made. 
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A makefile consists of a series of entries called rules that specify dependencies and 
actions. A rule begins in column 1 with a series of blank-separated target files, followed 
by a colon, followed by a blank-separated series of prerequisite files, also called source 
files. All the lines beginning with a tab that follow this are the actions-—such as compK 
lation—to be taken by the system to update the target files. The target files are depen¬ 
dent in some way on the prerequisite files and must be updated when the prerequisite] 
files are modified. 

In exercise 33, on page 406 in Chapter 8, “The Preprocessor,” we suggested that a! 
serious comparison of qsort() and qui cksortQ be undertaken. We wrote a program 
to do this that compared the running times for three sorting routines: our own quick- 
sortQ, the system-supplied qsortQ, and a qsortQ from another system for which 
we were able to borrow the code. The borrowed qsortQ consisted of some 200 lines of 
code in a single file. Including this file, our program consisted of approximately 400 
lines written in five files. Here is the makefile that we used: 
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r Act 


/home/blufox/base 


Hi 


'fids is an example of a macro definition. The general form of a macro definition is 


T macro_name = replacement .string 

1 By convention, macro names are usually capitalized, but they do not have to be. The 
t replacement string can contain white space. If a backslash \ occurs at the end of the 
line, then the replacement string continues to the next line. Our home directory on this 
d|particular machine is /home/bl ufox. We created the subdirectory base to hold all our 
«S other major subdirectories. We think of it as our “base of operation.” The macro BASE 
M refers to this subdirectory. 




mBm 


cc 

fFI &CX 

Vw*»- I I mm f V 


gcc 

-0 -Wall 


In file makefile 

# Makefile to compare sorting routines. 

BASE = /home/blufox/base 

CC = gcc 

CFLAGS = -0 -Wall 

EFILE = $(BASE)/bin/compare_sorts 

INCLS = -I$(L0C)/include 

LIBS = $(L0C)/1 ib/g_lib,a \ 

$(L0C)/Iib/u_lib.a 
L0C = /usr/local 

OBIS = main.o another„qsort.o 

compare.o quicksort.o 
$(EFILE): $(0BIS) 

©echo "linking ..." 

®$(CC) $ (CFLAGS) -o $(EFILE) $ 

$(0BIS): compare_sorts.h 

$(CC) $(CFLAGS) $ (INCLS) -c $ 


chkorder.o 


$(0B1S) $(LIBS) 
$*.c 


sss§§8l 


The first macro specifies the C compiler that we are using, in this case the GNU C com- 
| piler. The second macro specifies the options, or “C flags,” that will be used when gcc 
gets invoked. The -O option turns on the optimizer; the -Wall option asks for all warn- 
I mgs. If we had written 


CC 

CFLAGS 


CJ CH C** 


instead, then the replacement string for CFLAGS would be empty. 


CCTI C 
tr JL Lt 


$(BASE)/bin/compare_sorts 


The macro EFILE specifies where we want to put our executable file. Macro evaluation, 
or invocation, occurs with a construct of the form %Qmacro_name'). Tins produces the 
string value (possibly empty) of macro.name. The string value of the macro is also 
called its replacement string. Thus 


cril C 
trJ.Lt: 


$(BASE)/bin/compare„sorts 




Dissection of the Makefile for the compare_sorts Program 

■ # Makefile to compare sorting routines. 

Comments can be put in a makefile. A comment begins with a # and extends to the end 
of the line. 



is equivalent to 

EFILE = /home/blufox/base/bin/compare_sorts 
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ft 












INCLS 

LIBS 


LOC 


-I$(LOC)/include 
$(LOC)/Iib/g_lib.a 
$(LOC)/lib/u_lib. a 
/usr/local 




\ 


P^gimilarly, $(0BJS) is replaced by the list of object files, and so forth. Thus, the depen- 
R!5 Sd en cy line states that the executable file depends on the object files. If one or more of 


m 




he first macro specifies the -I option followed directly by the name of a directory rtv 
con ains include files. The second macro specifies two libraries. The third macro splf 
lies another directory. Note that a backslash is used to continue to the next line.il 
irst library, gjib.a, is the graceful library, which we discussed in Section 11.15, “LibJlr« 
ies on page 526. The second library, ujib.a, is the utility library, which we discus si 

f ectlon 1Llb - How to Time C Code,” on page 530. Because our program invofiP 
unctions from these libraries, the libraries must be made available to the compiler wp 
keep the associated header files g.lib.h and u.lib.h in the directory $(L0C)/lncluW 

Wl1 ! haV ® !° bc told to look in this directory for header files. Note thal 
macro can be evaluated before it has been defined. 


1 t be object files has been updated, then the specified actions occur. The second action 
ff 'lihc is expanded to 




@gcc -0 -Wall -o /home/blufox/base/bin/compare_sorts main.o \ 
another_qsort. o chk_order.o compare.o quicksorts \ 

/hotne/blufox/base/1ib/g_lib.a /home/blufox/base/1ib/u_lib.a 




■Jjpthough we have written it on three lines because of space limitations on the printed 
jfcpage, this is actually generated as a single line. Suggestion: If make is new to you, build 
jjjpyour makefiles without the @ symbol at first. Later, after you understand its effects, you 
Blcan use the @ symbol to prevent echoing. 








SSHSl 


0B3S = 


main. o 
compare.o 


another_qsort.o 
quicksort.o 


chk_order.o \ 


>1 




$(0BJS): compare_sorts.h 

$(CC) SCCFLAGS) $(INCLS) -c 


t it r 

4* * C, 


► »v'0<s 


Hi 


| The first line is a dependency line; it says that all the object files depend on the header 
file compare_sorts.h. If the header file has been updated, then all the object files have to 
be updated, too. This is done through the action line. In UNIX, action lines must begin 
Cwith a tab. In MS-DOS, they can start with a tab or with one or more blanks. The con¬ 
struct $* that occurs in the action line is a predefined macro called the base filename 
jj macro. It expands to the filename being built, excluding any extension. For example, if 
jl main.o is being built, then $*. c expands to mai n. c, and the action line becomes 


The macro OBJ Sis defined to be the list of object files that occurs on the right sideSa 

mainTnr W6 USed a backslash t0 continue the line. Although we ptf| 

mam.o first and then list the other .o files alphabetically, order is unimportant 


SCEFILE): $(0BJS) 

©echo "linking 
@$(CC) $(CFLAGS) 


-V.-V;::T r : ?rno 


H 


o SCEFILE) S(OBJS) $(LIBS) 


issilS 






taken Note car f p ™ denc y lme - The second and third lines specify the actions to b<if 

eight blank ! ly ? aCtl ° n lmeS begin with a single tab character. (It looks lileT ' 

fo § he l° n th<? SCreen,) The ® s V mb °l raeans that the action line itself is noil 

to be echoed on the screen (See pvprricp oq ™ rn ) » M®p 

has the form ee exercise 28, 01 pagL 552 *) Because macro invocation 


gcc -0 -Wall -I/usr/loeal/include -c main.c 




Certain dependencies are built into the make utility. For example, each .o file depends 
on the corresponding .c file. This means that if a .c file is changed, then it will be recom- 


jfj! piled to produce a new .0 file, and this in turn will cause all the object files to be 


m 


relinked. 


$( macro_r\ame ) 

the construct S(EFILE) is replaced by 

$(BASE)/bin/compare_sorts 
which in turn is replaced by 


- 




I/usr/local/include 






| An option of the form -Idir means “look in the directory dir for #i ncl ude files.” This 
U option complements our use of libraries. At the top of the .c files making up this pro- 


mm 


m gram, we have the line 


■ :f\ 

W'M&M 


/home/bl ufox/base/bi n/compare_sorts 


#include M compare„sorts.h" 

SI and at the top of compare_sorts.h we have the lines 

|§§ft 


wn 
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#include ”g_lib,h" 
include "u_lib.h 


n 


- : 

These header files contain the function prototypes for the functions in our ^ 

The -I option tells the compiler where to find these header files “"*** 



m 



The make utility can be used to maintain programs in any language not inct r -> 

C ++ . More generally, make can be used in any kind of projecttSSste X, 
dependencies and associated actions. consists of files with $gj 




11.18 The Use of touch 


The touch utility is always available in UNIX and is often available in MS-DOS It mn« '0$ 
new time on a file Thp mnho n fiUh ? a *a i • i . ni uuo. it puts ay3| 

times Md tnLh? U 5 hty deodes whlch actions t0 take by comparing flifeT 
tunes and touch can be used to direct what make does. f g 

previous ^section h along ^ US aSSUme We have the makefile discussed in the SS 

the me ° flleS ' T ° ^ date -W 



touch compare_sorts.h 


•‘jj > < ' 

■"■'.-Ak 


Thiscauses the file to have a more recent time than ail the object files that depend on 
It. Now, if we give thp mmmanrl J c U1<U U <-P™U on 


it. Now, if we give the command 
make 


all the .c files will be recompiled and linked to create a new executable file. 



: 




■ • 


? i 


TJkJi k @ 


'"a 


•■"v „ < %< 


' ".‘ V 


11.19 Other Useful Tools 

Operating systems provide many useful tools for the programmer. Here, we will list a 
few of the tools found on UNIX systems, along with some remarks. Comparable utilities 
are sometimes available in MS-DOS. 



Command 


Remarks 


cb 

dbx 

diff 

gdb 

grep 

indent 

wc 


The C beautifier; it can be used to “pretty print” C code. 

A source-level debugger; code must be compiled with the -g option. 
Prints the lines that differ in two files. 

The GNU debugger; code must be compiled with the -g option. 

Searches for a pattern in one or more files; a major tool for programmers. 
A C code “pretty printer” with lots of options. 

Counts lines, words, and characters in one or more files. 


The cb utility reads from stdi n and writes to stdout. It is not very powerful. To see 
what it can do, try the command 

cb < pgm.c 

where pgm.c is poorly formatted. The utility indent is more powerful, but unlike cb, it is 
not universally available. To make serious use of indent, you will need to read the 
online manual. 

A debugger allows the programmer to step through the code a line at a time and to 
see the values of variables and expressions at each step. This can be extremely helpful 
in discovering why a program is not acting as the programmer expected. The program¬ 
ming world is full of debuggers, and dbx is not a particularly good one. It just happens 
to be one that is generally available on UNIX systems. In the MS-DOS world, debuggers 
are an add-on product. Borland, Microsoft, and others provide excellent products. 

Tools such as diff, grep, and wc are of a general nature. They get used by everyone, 
not just programmers. Although these are UNIX tools, they are often available in MS- 
DOS as well, especially grep, which is very useful to programmers. 

Finally, let us mention that C can be used in conjunction with other high-level tools, 
some of which are languages in their own right. 
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y A 





Utility 


csh 


flex 


lex 


yacc 


Remarks 


A pattern scanning and processing language 


GNU’s version of yacc 


The C shell, which is programmable 


GNU’s version of lex 


Generates C code for lexical analysis 


A newer, more powerful, version of awk 


Practical extraction and report language 


A stream editor that takes its commands from a file 


“Yet another compiler-compiler,” used to generate C code 


:$gm 

itefe- mm 


K 
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Of particular importance to programmers are lex and yacc, or the corresponding GNU 
utilities, flex and bison; see The. UNIX Programming Environment by Brian Kernighan 
and Rob Pike (Englewood Cliffs, N.J.: Prentice-Hall, 1984). The Free Software Foundation 
produces GNU tools. These tools run on many platforms, and they can be obtained via 
the Internet. (The Internet provides access to other tools from other places as well.) If 
you have an Internet connection, the command 

ftp prep.ai.mit.edu 

will connect you to a machine from which you can download GNU tools. (First-time ftp 
users will need assistance.) 
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Summary 

1 The functions printfO and scanf(), and the related file and string versions of 
these functions, all use conversion specifications in a control string to deal with a 
list of arguments of variable length. 

2 The standard header file stdio.h is included if files are used. It contains the defini¬ 
tions of the identifier FILE (a structure) and the file pointers stdi n, stdout, and 
stderr. It also contains prototypes of many file handling functions and definitions 
for the macros getcO and putc(). The function call getc(ifp) reads the next 
character from the file pointed to by i f p. 

3 To open and close files, we use f open() and f cl ose (), respectively. After a file has 
been opened, the file pointer is used to refer to the file. 

4 A file can be thought of as a stream of characters. The stream can be accessed 
either sequentially or randomly. When a character is read from a file, the operating 
system increments the file position indicator by 1. 

5 The system opens the three standard files stdi n, stdout, and stderr at the begin¬ 
ning of each program. The function printfO writes to stdout. The function 
scanf O reads from stdin. The files stdout and stderr are usually connected to 
the screen. The file stdi n is usually connected to the keyboard. Redirection causes 
the operating system to make other connections. 

6 Files are a scarce resource. The maximum number of files that can be open simulta¬ 
neously is given by the symbolic constant F0PEN_MAX in stdio.h. This number is sys¬ 
tem-dependent; typically, it is in the range from 20 to 100. It is the programmer’s 
responsibility to keep track of which files are open. On program exit, any open files 
are closed by the system automatically. 

7 A set of functions that use file descriptors is available in most systems, even 
though these functions are not part of ANSI C. They require user-defined buffers. 
The file descriptors of stdi n, stdout, and stderr are 0,1, and 2, respectively. 

8 An operating system command can be executed from within a program by invoking 
system (). In MS-DOS, the statement 


a 




systemC’dir"); 
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will cause a list of directories and files to be listed on the screen. 

9 Sm N “n^,he ™Xan n d° ** '° the operaSng sys 

wc *. c 


Itfi rrLTr t] !fv SCreen thC WOrd COunt of a]1 lhe - c in the current directory. To 
write ° S stieam of characters from within a program, the programmer can 



wm 


FILE * i fp ; 

ifp = popen("wc *.c", "r"); 

Streams that are opened with popenC) should be closed with pcloseQ. 

10 fhe n utilhv?« tlri ?i Tl emS P u 0Vlde a Utlllty t0 create and manage libraries. In UNIX, 

DOS wnriH tr, 3 6 t G aicddver ’ and it is invoked with the ar command, hi the MS- 
DOS world, this utility is called the librarian, and it is an add-on feature. 




11 ^cesTt^imf ^ Can !f USed t0 keCp track of source files and to Provide convenient 
access to libraries and associated header files. 
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Experiment with the two versions of the program so that you understand the differ¬ 
ent effects. 

2 Rewrite the dbLspace program in Section 11.5, “An Example: Double Spacing a File," 
on page 507, so that it uses a command line option of the form -n, where n can be 
1, 2, or 3. If n is 1, then the output should be single spaced. That is, two or more 
contiguous newline characters in the input file should be written as a single newline 
character in the output file. If n is 2, then the output file should be strictly double 
spaced. That is, one or more contiguous newline characters in the input file should 
be rewritten as a pair of newline characters in the output file. If n is 3, the output 
file should be strictly triple spaced. 

3 Write getstringO and putstringQ functions. The first function should use a 
file pointer, say i fp, and the macro getcQ to read a string from the file pointed to 
by i f p. The second function should use a file pointer, say of p, and the macro 
putcQ to write a string to the file pointed to by ofp. Write a program to test your 

** »*■» IMi A ^ ^ ^ ^ 

4 Write a program to number the lines in a file. The input file name should be passed 
to the program as a command line argument. The program should write to stdout. 
Each line in the input file should be written to the output file with the line number 
and a space prepended. 

5 Read about the ungetcQ function in Section A.l 2, “Input/Output: <stdio.h>,” on 
page 660. After three characters have been read from a file, can ungetcQ be used 
to push three characters back onto the file? Write a program to test this. 

6 Write a program that displays a file on the screen 20 lines at a time. The input file 
should be given as a command line argument. The program should display the next 
20 lines after a carriage return has been typed. (This is an elementary version of the 
more utility in UNIX.) 

7 Modify the program you wrote in the previous exercise to display one or more files 
given as command line arguments. Also, allow a command line option of the form 
-n to be used, where n is a positive integer specifying the number of lines that are 
to be displayed at one time. In MS-DOS, the command to clear the screen is ds\ in 
UNIX it is clear. Try either one or the other of these commands on your system so 
that you understand its effects. Use either system("cl s") or system("cl ear") in 
your program just before you write each set of lines to the screen. 
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8 Ihe library function fgetsO can be used to read from a file a line at a time. Read 
about fgets() in Section A.12, “Input/Output: <stdio.h>,” on page 659. Write a 
program called search that searches for patterns. If the command 

search hello my_file 

is given, then the string pattern hello is searched for in the file my^ file. Any line that 
contains the pattern is printed. (This program is an elementary version of grep.) 
Hint: Use the following code: 

char line[MAXLINE], *pattern; 

FILE *ifp; 
if (argc != 3) { 


if CCifp = fopen(argv[2], "r")) == NULL) { 

fprintf(stderr, "\nCannot open %s\n\n", argv[2]); 
exit(l); 

} 

pattern = argv[l]; 

while (fgets(line, MAXLINE, ifp) != NULL) { 
if (strstrdine, pattern) != NULL) 


9 Modify the function you wrote in the previous exercise. If the command line option 
-n is present, then the line number should be printed as well. 

10 Compile the following program and put the executable code into a file, say try_me: 
#include <stdio.h> 

int main(void) 

{ 

fprintf(stdout, "She sells sea shells\n"); 
fprintf(stderr, "by the seashore.\n"); 
return 0; 


Execute the program so that you understand its effects. What happens when you 
redirect the output? Try the command 


try_me > tmp 
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Make sure you read the file tmp after you do this. In UNIX, you should also try the 
command 

try_me > & tmp 

This causes the output that is written to stderr to be redirected, too. Make sure 
that you look at what is in tmp. You may be surprised! 

11 Write a program called wrt_rand that creates a file of randomly distributed num¬ 
bers. The filename is to be entered interactively. Your program should use three 
functions. Here is the first function: 

void get_info(char *fname, int *n_ptr) 

{ 

printf("\n%s\n\n%s", 

"This program c a file of random numb 0 V S » j 

"How many random numbers would you like? "); 
scanf("%d", n_ptr); 

printf("\nln what file would you like them? "); 
scanf("%s", fname); 

} 

After this function has been invoked in mai n(), you could write 
ofp = fopen(fname, "w"); 

However, the named file may already exist; if it does, overwriting it will destroy 
whatever is in the file currently. In this exercise, we want to write cautious code. If 
the file already exists, report this fact to the user and ask permission to overwrite 
the file. Use for the second function in your program the following “careful” version 
of fopen(): 
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FILE «cfopen(char *fname, char *mode) 

char reply[2]; 

FILE *fp; 

lf "*"} “ 0 44 access Cfrtame, F.OK) 

1 S^Ty)? ■ erwrite 1t? 

if (*reply |= 'y 1 §§ *reply 1 = ' y 1 ) / 
printf("\nBye!\n\n"); 
exit(l); 

} 

} 

fp = gfopen(fname, mode); 
return fp; 


== 0 ) { 


SncUon i s U gfonenn 0 the Sec,lon A ' 16 ' “Miscellaneous," on page 680.) The thin 
Hon 11.6, "Using *? PKSented ” S « 

write your randomly distributed numbers neatly, use'thefollowingmode'' *"* T ' 

for (i = l; -j <= n . ++ -j) { 

fpnntfCofp, "%i2d", randQ); 
if O % 6 == 0 || i == n ) 
fprintf(ofp, "\n") : 

} 

12 McaSepfmclTof wh “ “t Wh “ * ls the file position 

nism for i sniL w y0U are 1,1 ,he fUc ' Thcre ls ”» comparable mecha- 

What gets printed in "n; i a a P nd™ a p“ thM •* «*** 

char c, s[] = ”abc", .p , s; 

FILE *ofpl, *ofp2; 

ofpi = fopenCtmpl”, " w ") ■ 
ofp2 = fopen("tmp2", " w ") ; 

for ci = 0; i < ; +i ) ’ 

sscanf(s, "% c ", & c )• 

, fprintfCofpl, "% c ".’cV 
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for (i = 0 ; i < 3 ; ++i) { 
sscanf(p++, "%c", &c); 
fprintf(ofp 2 , "%c", c); 

} 

13 In this exercise, we examine a typical use of sscanf (). Suppose we are writing a 
serious interactive program that asks the user to input a positive integer. To guard 
against errors, we can pick up as a string the line typed by the user. The following is 
one way to process the string; 

char 1ine[MAXLINE]; 
int error, n; 

do { 

printf("Input a positive integer: ") ; 
fgetsCline, MAXLINE, stdin); 
error = sscanf(line, "%d", &n) != 1 || n <= 0 ; 
if (error) 

printf("\nERR0R: Do it again.\n"); 

} while (error); 

This will catch some typing errors, but not all. If, for example, 23e is typed instead 
of 233, the error will not be caught. Modify the code so that if anything other than a 
digit string surrounded by optional white space is typed, the input is considered to 
be in error. Use these ideas to rewrite the wrt_rand program that you wrote in exer¬ 
cise 11, on page 545. 

} 

14 The two conversion characters x and X can be used to print an expression as a hexa¬ 
decimal number. Are the two conversion characters equivalent? Hint: Try 

printf("11259375 = %#lx\n", 11259375); 
printf("11259375 = %#!X\n", 11259375); 

(In case you are wondering, the number 11259375 was carefully chosen.) 

1 5 Can your compiler handle the conversion character n correctly? (A few years ago, all 
the ones we tried could not, but now they can.) Try the following; 

int a, b, c; 

printf("a%nb%nc%n %d %d %d\n", &a, &b, &c, a, b, c); 
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16 Can we give flag characters in a conversion specification in any order? The ANSI 
document is not too specific about this point, but it seems that the intent is for 
order to be acceptable. See what happens with your compiler when you try the 
lowing code: 

printf("%0+17d\n", 1); 
printf("%+017d\n", 1); 

17 Will the following code get a hexadecimal number from a string? What happens 
0x is deleted in the string? 

char s[] = "0xabc"; 
int n; 

sscanf(s, "%x", &n); 
printf("Value of n: %d\n", n); 

18 Did you read the table of conversion characters for scanf () carefully? Does the fol¬ 
lowing code make sense? Explain. 

char s[] = "-1"; 

unsigned n; 

sscanf(s, "%u", &n); 

printf("Value of n: %u\n", n); 

19 Investigate how tmpnamQ makes its names. Try executing, for example, the follow¬ 
ing program: 

#include <stdio.h> 

int main(void) 

{ 

char tfn[100]; /* tfn = tmp filename */ 

tmpnam(tfn); 

printf("1: tfn = %s\n", tfn); 
tmpnam(tfn); 

pri ntf ("2: tfn = %s\n", tfn); 
tmpnam(tfn); 

printf("3: tfn = %s\n", tfn); 
return 0; 
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Execute the program repeatedly so that you understand its effects. Notice that tfn 
changes one way within the program, and it changes another way with each execu¬ 
tion of the program. On our system, the following line occurs in stdio.h: 

#define TMP_MAX 17576 /* 26 * 26 * 26 */ 

In ANSI C, repeated calls to tmpnam() are supposed to generate at least TMP_MAX 
unique names. On our system, exactly TMP_MAX unique names are generated. What 
happens on your system? 

20 Is the Borland C/C++ compiler available to you? If so, try the command 


When no files are given with the command, then a list of all the options is printed 
on the screen. This is a very nice feature. Try it. 

21 Our program that double-spaces a file can be invoked with the command 

dbLspace infile outfile 

If outfile exists, then it will be overwritten. This is potentially dangerous. Rewrite 
the program so it writes to stdout instead. Then the program can be invoked with 
the command 

dbLspace infile > outfile 

This program design is much safer. Of all the system commands, only a few are 
designed to overwrite a file. After all, nobody likes to lose a file by accident. 

22 In the early days of MS-DOS, a control-z character within the file was used as an 

end-of-file mark. Although this is not done now, if a file has a control-z in it, and it 
is opened as a text file for reading, characters beyond the control-z may be inacces¬ 
sible. Write a program with the following lines in it: 

char cntrl_z = ’\032'; /* octal escape for control-z */ 

int c; 

FILE *ifp, *ofp; 
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ofp = fopen("tmp", "w"); 
fprintf(ofp, "%s%c%s\n", 

"A is for apple", cntrl_z, " and alphabet pie."); 
fclose(ofp); 

ifP = fopen("tmp", "r"); /* open as a text file */ 

while ((c = getc(ifp)) != EOF) /* print the file */ 

putchar(c); 
fclose(ifp); 

printf("\n \n"); /* serves as a marker */ 

ifp - fopenf tmp", "rb"); /* open as a binary file */ 

while CCc = getc(ifp)) != EOF) /* print the file */ 

putchar(c); 

What gets printed? (Does the program act differently on a UNIX system?) In MS- 
DOS, try the command 

type tmp 

Only the characters before the control-z are printed. How do you know that there 

are more characters in the file? Hint: Try the dir command. Normally, control-z 

characters are not found in text files, but they certainly can occur in binary files. 

Subtle problems can occur if you open a binary file for processing with mode "r" 
instead of "rb". 

23 If UNIX is available to you, experiment to see what the following program does: 

^include <stdio.h> 

#include <stdlib.h> 

#define MAXSTRING 100 

int main(int argc, char **argv) 

char command[MAXSTRING]; 

sprintf(command, "sort -r %s" , argv[l]); 
system(command); 
return 0; 

} 

Actually, the program needs improvement. Rewrite it so that it prints a prompt to 
the user. Give the command 

man sort 


to read about the sort utility. 
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24 If you are a C programmer, should you care about assembler code? Surprisingly, the 
answer is yes. The -S option causes your compiler to produce a .s file, and that file 
can be useful to you, even if you cannot read a line of assembler code. Write a sim¬ 
ple program that contains the following lines: 

int i = 10; 

while (— i != 0) /* inefficient? */ 

printfC'i = %d\n", i); 

The value of the expression —i ! = 0 controls the execution of the whi 1 e loop. Is 
the control mechanism inefficient? Write another program that contains the lines 

int i = 10; 

while (—i) /* better? */ 

printfC'i = %d\n", i); 

Compile both of your programs with the -S option. Then look at the difference 
between the two .s files. In UNIX, this can be done with the command 

diff pgml.s pgm2.s 

Is your second program more efficient than your first? 

25 Is Turbo C available to you? If so, make the following modifications to the 
change_case program we presented in Section 11.8, “File Descriptor Input/Output,” 
on page 515. Include the standard header files io.h and sys\stat.h. Replace the octal 
constant 0600 by S_IREAD | S_IWRITE. (These symbolic constants are discussed 
in the Turbo C manual.) With these changes the program should compile and exe¬ 
cute. Does it? 

26 If UNIX is available to you, give the command 

Is -/ 

This provides a long listing of all the files and subdirectories in the current direc¬ 
tory. Note that file modes are displayed. Read about the chmod utility in the online 
manual. To further your understanding about file modes, try commands similar to 
the following lines which have octal numbers. Are the leading zeros necessary? 

date > tmp; is -I tmp 
chmod a+rwx tmp; Is -I tmp 
chmod 0660 tmp; Is -I tmp 
chmod 0707 tmp; Is -I tmp 
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In file makefile 


# Experiment with the make command, 
go: hello date list 
©echo Goodbye! 

hello: 

©echo Hello! 

date: 

©date; date; date 

I s 

i st: 

©pwd; 1s 


27 Look carefully at the execution profile we presented in Section 11.14, “Using the 
Profiler,” on page 524. You can see that rand() was called 50,000 times. This is cor 
rect because the size of the array is 50,000. Note that the number of function calls 
to find_.pivot() equals the number of calls to quicksortQ. By looking at the 
code, we easily convince ourselves that this, too, is correct. But what about the rela¬ 
tionship between the number of calls to parti ti on () and the number of calls to 
qui cksortQ? Can you give a precise explanation? 


28 The last makefile that we presented in this chapter is a real one. Even though we 
dissected it, anyone who has not had experience with make will find the concepts 
difficult to grasp. If tins utility is new to you, try the make command after you have 
created the following file: 


What happens if you remove the © characters? What happens if a file named hello or 
date exists? 


29 Create the following file, and then give the command make. What gets printed? 
Write your answer first; then experiment to check it. 


In file makefile 


# Experiment with the make command! 
Start: A1 A2 

©echo Start 

Al: A3 

©echo Al 

A2: A3 

©echo A2 
A3: 

©echo A3 
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30 What gets printed? 

#include <stdio.h> 

int main(void) 

{ 

printf("Hello!\n"); 
fclose(stdout); 
printf("Goodbye!\n"); 
return 0; 






Chapter 12 

Advanced Applications 




C has the ability to be close to the machine. It was originally used to implement UNIX, 
and building systems is still one of its important uses. In this chapter, we describe 
some advanced applications, including material useful for systems programmers. We 
also discuss the use of matrices for engineers and scientists. 


12.1 Creating a Concurrent Process with fork() 

UNIX is a multiuser, multiprocessing operating system. Each process has a unique pro¬ 
cess identification number. The following command will show you what your machine 
is currently doing: 

ps -aux 

Here is an example of its output: 


0:00 ps -aux 
0:08 vi find.c 


USER 

PID 

%CPU 

%MEM 

sz 

RSS 

TT 

STAT 

START 

blufox 

17725 

34.0 

1.6 

146 

105 

12 

R 

15:13 

amber 

17662 

1.4 

7.0 

636 

469 

3 5 

S 

15:10 

root 

143 i 

0.5 0 

.1 5 

3 

? 

s 

3ul 

31 10: 


The first line contains headings. The remaining lines supply information about each 
process: the user's login name, the process identification number, and so forth. (Read 
the online manual to find out more about the ps command.) The operating system runs 
many processes concurrently by time-sharing the machine resources. 
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HI m 


Ifete 


In UNIX, the programmer can use fork() to create a new process, called the child 
process , that runs concurrently with the parent process. (It is not part of ANSI C.) The 
function forl<() takes no arguments and returns an i nt. The following is a simple pro¬ 
gram illustrating its use: 


m 




a 


mm 

jjglfe 


finclude <stdio.h> 


int main(void) 

{ 

int fork(void), value 


value = fork(); 
printf("In main: value 
return 0; 


%d\n", value); 


/* new process */ 


Illlllilll 

■ IMfi 

Slip 
llillils 

ilr 



The output of this program changes each time we run it. Here is example output: 


In main: value = 17219 
In main: value = 0 


Hj ljljp 

WSIm 

mm 


As noted, when forl<() is invoked, it creates a new process, the child process. This new 
process is an exact copy of the calling process, except that it has its own process identi¬ 
fication number. The function call fork() returns 0 to the child, and it returns the 
child’s process ID to the parent. In the output of our program, the first line was printed 
by the parent, the second line by the child. 

Let us modify this program by adding to the code a second copy of the statement 



value = fork(); 


Here is what gets printed: 


In main: value 
In main: value 
In main: value 
In main: value 


17394 

0 


17395 


The children have unique process identification numbers. The program created four 
concurrent versions of mai n(). The order of execution of these processes is system- 
dependent and nondeterministic —that is, the order is not necessarily the same for each 
execution of the program. (See exercise 1, on page 586.) Caution: Invoking fork() too 
many times can cause the system to fail by exhausting all available processes. 

When forl<() is invoked, it creates two processes, each with its own set of variables. 
If a file pointer is used, however, care must be taken because the file pointer in each of 
the processes will refer to the same underlying file. 
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The value returned by forkO can be used in an i f-el se statement to ^tauna.e 
Uratween the actions of the child and the parent. In our next program, we comput 
Fibonacci numbers in the child process and print elapsed time in the parent process, 
si eepO to suspend execution of the parent process for 2-second tntervals. 

/* Compute Fibonacci numbers and print time asynchronously. */ 

#include <stdio.h> 

#include <time.h> 

int fib(int); 

int fork(void); 

void sleep(unsigned); 

int main (void) 

int begin = time(NULL), i; 

/* rhild */ 

if (fork() ==* 0) 

f or (i =0; i <30; ++i) 

printf("fib(%2d) = %d\n", i, fib(i)); ^ parent */ 

elS for (i = 0; i < 30; ++i) { 

prin?f ("elapsed time = %d\n", time(NULL) - begin); 

} 

return 0; 

} 

int fib(int n) 

if (n <= 1) 
return n; 

return (fib(n - 1) + fib(n - 2)); 

> 

When this program is executed, the outputs of the two processes are intermixed m a 
nondeterministic fashion. 
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12.2 Overlaying a Process: the exec... () Family 

From within a program, the current process, meaning the program itself, can be over 
laid with another process. To do this, the programmer calls a member of the 
exec. . . () family. (See Section A.16, “Miscellaneous,” on page 682, for a list of all 
members.) These functions are not part of ANSI C, but, typically, they are available in 
both MS-DOS and UNIX. 

We want to use the execl () function to illustrate how one process can be overlaid 
with another. The function prototype is 

int execl(char *path, char *arg0, ...); 

In MS-DOS, this prototype is provided typically in the header file process.h. In UNIX, it 
can be in this header file or in some other. The first argument is the path of the execut¬ 
able file—that is, the new process. The remaining arguments correspond to the com¬ 
mand line arguments expected by the new process. The argument list ends with the 
null pointer 0. The value -1 is returned if the executable file cannot be found or is non¬ 
executable. 

Before we use execl () in a program, let us write two other small programs. We will 
use the compiled code to overlay another process. 

In file pgml.c 

#include <stdio.h> 

int main(int argc, char **argv) 

{ 

i nt i; 


printf("%s: ", argv[0]); 
for (i =1; i < argc; ++i) 
printf("%s ", argv[i]); 
putchar(‘\n'); 
return 0; 


/* print the arg list */ 
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In file pgm2.c 

#inc1ude <stdio.h> 

int mainfint argc, char **argv) 

{ 

int i, sum = 0, value; 

-f or fi =0; i < argc; ++i) /* sum The arguments ,c / 

if (sscanf (argv[i], "%d", &value) == 1) 

p r i nt f( H %s: sum of command line args = %d\n , argv[0], sum), 
return 0; 

} 

Next, we compile the two programs: 

cc “O pgml pgmhci cc -o pgm2 pgm2.c 

Observe that pgml and pgm2 are executable files in the current directory. In our next 
program, we will overlay the parent process with one of these mm processes. 

#include <stdio.h> 

#include <process.h> 

int main (void) 

{ 

int choice = 0; 

pri ntf ("%s\n%s\n%s M , - . . „ 

'The parent process will be overlaid. , 

"You have a choice.”, 

"Input 1 or 2; "); 

scanf C n %d” , &choice) ; 

putcharC'Xn 1 ); 

lf execl ("pgml""! "pgml", "a", "b", "c", 0); 

1f .£!?l?pS£ "PjPj-.-r.Ti or" 2 ’• 0) 5 

printf ("ERROR: You did not input 1 or 2.\n ), 

return 0; 

} 
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II we run this program under MS-DOS and enter 1 when prompted, here is what appears 
on the screen: 

The parent process will be overlaid. 

You have a choice. 

Input 1 or 2: 1 

C:\CENTER\PGM1.EXE: a b c 

When a process is successfully overlaid, there is no return to the parent. The new pro¬ 
cess takes over completely. 

In UNIX, forkQ is often used when overlaying one process with another, 
if (fork() == 0) 

execl ("/c/c/bf/bin/mmf" , "rnmf" , "-f”, 0); 

/* execute rnmf */ 

else 

. /* do something else */ 

Using the spawn... () Family 

Most C systems in MS-DOS provide the spawn... Q family of functions. This family is 
similar to the exec. .. () family, except that the first argument is an integer mode. 



Modes for the spawn...() family Meaning 


u Parent process waits until child process com- 

________pletes execution. 

_^'_Concurrent execution—not yet implemented. 

^ Child process overlays the parent process; 

_____same as equivalent exec... () call. 

MS-DOS is not a multiprocessing operating system. Where a programmer might use 

orkQ and execl () in UNIX, spawnl (0, . ..) could be used instead in MS-DOS. Here 
is a call to spawnl () that invokes chkdsk: 

spawnl(0, "c:\chkdsk", "chkdsk", "c:", ”/f", 0); 

Because the mode is 0, the parent process will wait until the child process c:\chkdsk 
completes execution before continuing with its work. 
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12.3 Interprocess Communication Using pipe() 

In UNIX, the programmer can use pipe() to communicate between concurrent pro¬ 
cesses. The function prototype is given by 

int pipe(int pd[2]); /* pd stands for pipe descriptor. */ 

The function call pipe(pd) creates an input/output mechanism called a pipe. Associ¬ 
ated file descriptors, or pipe descriptors, are assigned to the array elements pd [0] and 
pd [1]. The function call returns 0 if the pipe is created, and returns -1 if there is an 
error. 

After a pipe has been created, the system assumes that two or more cooperating pro¬ 
cesses created by subsequent calls to fork() will use read() and wri te() to pass data 
through the pipe. One descriptor, pd[0], is read from, and the other, pd [1], is written 
to. The pipe capacity is implementation-dependent, but is at least 4,096 bytes. If a write 
fills the pipe, the pipe is blocked until data are read out of it. As with other file descrip¬ 
tors, closeO can be used to explicitly close pd [0] and pd[l]. 

To illustrate the use of pi pe (), we will write a program that computes the sum of the 
elements of an array. We compute the sum of each row concurrently in a child process 
and write the values on a pipe. In the parent process, we read the values from the pipe. 

In file concurrent_sum.c 

/* Use pipes to sum N rows concurrently. */ 

#include <stdio.h> 

#include <stdlib.h> 


#define N 




MM 


warn 


IBS: 


i §§■ 

m ja 

mWm 

%km 


int add_vector(int v[]); 

void error_exit(char *s); 

int fork(void); 

int pipe(int pd[2]); 

int read(int fd, void *buf, unsigned len); 

int writefint fd, void *buf, unsigned len); 
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int main(void) 

{ 

int a[N][N] . {{1 , 1} , {2 , 2 . 2} {3 3 3}} 

t, row_sum, sum = 0 , 

/* ,^-j j 


/* Pipe descriptors */ 

if (pipe(pd) == -i) /* 

error_exit("pipeQ failed"); / create a p,pe •/ 

for O = 0 ; i < N; ++i) 

if (forkO == 0) { /* child *, 

row_sum = add_vector(a[i]); 
if (write(pd[l], &row_sum, sizeof(int)) == -i) 
error_exit("write() failed"); 

} return: /* return from child *, 

for (i = 0; i < n ; ++ -j) { 

if (read(pd[0], &row_sum, sizeof(int)) == -li 
error_exit("read() failed"); 
sum += row_sum: 

} 

printf("Sum of the array = %d\n", sum); 
return 0; ’ J ’ 


/* child process */ 


int add_vector(int v[]) 

{ 

int i, vector_sum = 0 ; 

for (i = 0 ; i < M ; ++ -j) 
vector_sum += v[i]; 

. return vector_sum; 


void error_exit(char *s) 

fprintf(stderr, "\nERROR: %s 
exwt(l); 


bye!\n", s) 
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B 


Dissection of the concurrent_sum Program 

■ if (pipe(pd) == - 1 ) 

error_exit("pipe() failed"); 


/* create a pipe */ 


A pipe is created before other processes are forked. If the call to pi pe() fails, we 
invoke error_exit() to write a message to the user and exit the program. 


■ for (i =0; i < N; ++i) 

if (fork() == 0) { /* child process */ 

row_sum = add_vector(a[i]); 

if (write(pd[l], &row_sum, sizeof(int)) == -1) 
error_exit("write() failed"); 

return; /* return from child */ 

} 

Each time through the loop, we use fork() to create a child process. After row_sum has 
been computed, we write it on the pipe with the function call 

write(pd[l], &row„sum, sizeof(int)) 

If the call to wri te() fails, we invoke error_exi t() to write a message to the user and 
exit the program. Note carefully that we explicitly retu rn from the child after the call 
to write(). If we do not do this, then the children will themselves create children each 
time through the loop. 

■ for (i =0; i < N; ++i) { 

if (read(pd[0], &row_sum, sizeof(int)) == -1) 
error_exit("read() failed"); 
sum += row_sum; 


In the parent process, we invoke read() to read row_sum from the pipe. If the call to 
read() fails, we invoke error_exi t() to write a message to the user and exit. 
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12.4 Signals 



An exceptional condition, or signal, is generated by an abnormal event. For example, 
the user may type a control-c to effect an interrupt, or a program error may cause a bus 
error or a segmentation fault. A floating-point exception occurs when two very large 
floating numbers are multiplied, or when division by zero is attempted. ANSI C pro¬ 
vides the function signal () in the standard library. Its function prototype and some 
macros are in signalh. The material in this section applies to both MS-DOS and UNIX. 

The exceptional conditions that can be handled by the operating system are defined 

* 

as symbolic constants in signalh. Some examples are 




#define 
#define 
#define 
#define 


SIGINT 

SIGILL 

SIGFPE 

SIGSEGV 


/* interrupt */ 

/* illegal instruction */ 

/* floating-point exception */ 
/* segment violation */ 




Although the signals that can be handled are system-dependent, the ones that we have 
listed are common to most C systems. 

If an exceptional condition is raised in a process, then the typical default action of 
the operating system is to terminate the process. The programmer can use signal () to J 
invoke a signal handler that replaces the default system action. The function prototype .. 


IKS 


■plips 

mmm 


igj 


void (*signal(int sig, void (*func)(int)))(int); 

This function takes two arguments, an i nt and a pointer to a function that takes an 
i nt, and returns nothing. The function returns a pointer to a function that takes an i nt 
and returns nothing. 

The function call signal (sig, func) associates the signal sig with the signal han¬ 
dler funcQ. This causes the system to pass sig as an argument to func() and invoke 
it when the signal sig is raised. 

Some special signal handlers are defined as macros in signalh. We will use two of 
them: 


r" 



#define 
#define 


SIG_DFL 

bxvj_ L(jN 


((void (*)(int)) 0) 
((void (*)(int)) 1) 


/* default */ 
/* ignore */ 


The casts cause the constants to have a type that matches the second argument to sig 
nal (). Here is an example of how the second macro gets used: 
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signal(SIGFPE, SIG_IGN); /* ignore floating-point exceptions */ 

This causes floating-point exceptions to be ignored by the system. If at some later point 
we want to resume the default action, we can write 

signal(SIGFPE, SIG_DFL); /* take default action >7 

If a standard signal handler is inappropriate in a given application, the programmer 
can use signal () to catch a signal and handle it as desired. Imagine wanting to use an 
interrupt to get the attention of a program without terminating it. The following pro¬ 
gram illustrates this: 

/* Using a signal handler to catch a control-c. */ 

#include <stdio.h> 

#include <signal.h> 

#include <stdlib.h> 

#define MAXSTRING 100 

void cntrl_c_handler(int sig); 
int fib(int n); 

int main(void) 

{ 

int i; 

signal(SIGINT, cntrl_c_handl er); 
for (i =0; i <46; ++i) 

printf("fib(%2d) =%d\n", i, fib(i)); 

return 0; 

} 

void cntrl_c_handler(int sig) 

{ char answer[MAXSTRING]; 

printf("\n\n%s%d\n\n%s", „ . 

"Interrupt received! Signal = , sig, 

"Do you wish to continue or quit? "); 
scanf("%s", answer); 
if (*answer == 'c') 

signal(SIGINT, cntrl_c_handler); 

else 

exi t(l); 


} 

















566 


Chapter 12 ▼ Advanced Applications 


■ -CjS 
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The function fi b() is not shown. It is the same function that we used in Section 12.1 * 
re a ting a Concurrent Process with f o r k ( ), ” on page 557. 1 


i-gggggi 





issis- 

■ 




Dissection of the fib_signal Program 

■ signal(SIGINT, cntrl_c_handler); 




m 



iaiSed ’ the system catches jt and Passes control to the function" tM 
cntrl_c_handler(). 1 


void cntrl_c_handler(int sig) 

char answer[MAXSTRING]; 
printf("\n\n%s%d\n\n%s", 

"Interrupt received! Signal = ", si a 
"Do you wish to continue or quit? ");’ 


When signal () passes control to this function, a message is printed on the screen. 


if (^answer == 1 c f ) 

signal(SIGINT, cntrl_c_handler); 
else 

exit(l); 


signal(SIGINT, cntrl_c_handler); 
allows us to catch another interrupt. 
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12.5 An Example: The Dining Philosophers 

The dining philosophers problem is a standard model for synchronizing concurrent 
processes that share resources. Five philosophers are sealed around a circular table. 
Each philosopher has one chopstick at his side and a bowl of rice in front of him. Eating 
rice requires two chopsticks. There are only five chopsticks, so at most two philoso¬ 
phers can be eating at any one time. Indeed, if each philosopher picked up one chop- 
stick, none could have two and they would all be deadlocked. A philosopher may 
acquire only the chopsticks to his or her immediate left or right. The problem is to 
write a program with concurrent processes representing the philosophers, where each 
philosopher gets to eat fairly often. 


Fifty years of 
reading books . 
for THIS? 


Touch my 
chopstick and 
you’ll KNOW what 
non-existence is! 


It’s true! A single 
chopstick is without 
MEANING! 


\ ft 


*lui( 




mtt 


r * % 

c 


What! No 
SOY sauce!? 


<%&0 



0 o 




Hey! I don*t \ 0 

even LIKE rice! / o 0 0 


























det = determinant(a); 
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The function definition for determi nant () will look like 

double determinant(double a[][3]) 

{ 

♦ « * * ► 

Because the compiler needs the 3 to build the correct storage mapping function, our 
determinant function can be used only on 3 x 3 matrices. If we want to compute the 
determinant of a 4 x 4 matrix, we need to write a new function definition. This is unac¬ 
ceptable. We want to be able to write a determinant function that can be used foi 
square matrices of any size. 


Building Matrices with Arrays of Pointers 


By starting with the type pointer to pointer to double, we can build a matrix of what¬ 
ever size we want, and we can pass it to functions that are designed to work on matri¬ 
ces of any size. Let us start with the code we need to create the space for a matrix: 


int i, j, n; 

double **a, det, tr; 

. /* get n from somewhere */ 


a = calloc(n, sizeof(double *)); 
for (i =0; i < n; ++i) 

a[i] = calloc(n, sizeof(double)); 


The size of the matrix does not have to be known a priori. We can get n from the user or 
read it from a file or compute it. It does not have to be coded as a constant in the pro¬ 
gram. Once we know the desired size, we use the standard library function cal loc() to 
create the space for the matrix dynamically. The function prototype is given in stdlib.h 

ns 


void *calloc(size_t niterns, size_t size); 
Typically, the type definition for si ze_t 

typedef unsigned size_t 


This type definition can be found in stddef.h and sometimes in stdlib.h , too. If si ze_t is 
not defined directly in stdlib.h , then stddef.h will be included there. A function call of 
the form 
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calloc(n, sizeof( type )) 

allocates space in memory for an array of n elements of the specified type. The base 
address of the array is returned. Thus, the statement 

a = calloc(n, sizeof(double *)); 

allocates space for a, which we can now think of as an array of pointers to double of 
size n. In the for loop, each element of a is assigned the base address of space allo¬ 
cated in memory for an array of doubl es of size n. We can think of a in memory as 

An nxn matrix in memory 





« i « 


0 

1 

2 


n-1 




mmm 


0 

1 

mmm 


n-1 




mmm 


0 

* 

ar 

• 

1 

2 


n-l 




mmm 



012 n-1 



Now that space has been created for the matrix, we want to fill it. Let us assign to the 
elements of a integer values that are randomly distributed in the range from -9 to +9. 

for (i =0; i < n; ++i) 
for (j =0; j < n; ++j) 

a[i][j] = rand() % 19 - 9; /* from -9 to +9 */ 

Note that even though our matrix a is stored in memory as an array of pointers, the 
usual matrix expression a[i] [j] is used to access the element in the ith row, jth col¬ 
umn (counting from zero). Because a is of type doubl e **, it follows that a[i ] is of 
type double *. It can be thought of as the ith row' (counting from zero) of the matrix. 
Because a[i ] is of type double *, it follow's that a[i] [j] is of type double. It is the 
jth element (counting from zero) in the ith row. 
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Now that we have assigned values to the elements of the matrix, we can pass the 
matrix as an argument to various functions. Suppose we want to print it, compute its 
determinant, and compute its trace. In the calling environment we can write 


print_matrix(a, n); 
det = determinantCa, n); 
tr = tracefa, n); 

The function definition for print_matrix() is given by 

void print_matrix(double * * a, int n) 

{ 

int i, j; 

for (i = 0; i < n ; ++i) { 
for (j = 0; j < n; ++j) 

printf("%7.1f", a[i][j]); 
putchar('\n'); 

} 

putchar('\n'); 


The function definition for determinantQ begins as 

double determinant(double **a, int n) 

{ ' 


Because the determinant function is complicated, we postpone further discussion until 

the exercises. The trace, however, is easy. By definition, the trace of a matrix is the sum 
of its diagonal elements: 

double trace(double **a, int n) 

{ 

int i; 

double sum = 0.0; 

for (i =0; i < n; ++i) 
sum += a[i] [i]; 
return sum; 
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Adjusting the Subscript Range 

In mathematics, the subscripts for vectors and matrices usually start at 1, not 0. We can 
arrange for our C code to do this, too. The following function can be used to create 
space for an n-vector: 

double *get_vector_space(int n) 

{ 

int i; 

double *v; 

v = calloc(n, sizeof(double)); 

return (v - 1); /* offset the pointer */ 

} 

Because the pointer value that is returned has been offset to the left, subscripts in the 
calling environment will run from 1 to n rather than from 0 to n - 1. 

An n-vector indexed from 1, not 0 

Shaded area indicates 
v space not owned by the 

\ programmer 


0 12 n 

Note that in the picture, we have shaded an element to indicate that the programmer 
does not own element 0. Now we can write code like the following: 

int n; 

double *v; 

. /* get n from somewhere */ 

v = get„vector„space(n); 
for (i = lj i <= n \ ++i) 

v[i] = randQ %’l9 - 9; /* from -9 to +9 */ 

* » K * N 

Instead of offsetting a pointer, we always have the option of allocating more space 
and then not using it all. 
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v - calloc(n + 1, sizeof(double)); 
for (i = 1; i <= n; ++i) 
v[i] = ... 



/* allocate extra space */ |j 


Wtp- 

WmMKwSmt. wmm 




I he technique of offset ting the pointer is better in the sense that no space is wasted. 
Let us use the technique to write a function that creates space for an m x n matrix with 
subscripts that start at 1 rather than 0, j 

double **get_matrix_space(int m, int n) 

i nt i; 

double **a; 


mmmmi 
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a = callocCm, sizeof(double *)); 

~“ a > , /* offset the pointer */ 

for (i = lj i <:== m; ++i) f 

a[i] = calloc(n, sizeof(double)); 

a Ci]! /* offset the pointer */ 

return a; 


We can think of the matrix a in memory 7 as 


Anmxn matrix indexed from 1, not 0 


lilll#; 
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Note that once again we have used shaded areas to indicate space not owned by the 
programmer. 
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If a programmer wants space previously allocated by callocQ to be made available 
to the system again, then the function f ree() must be used. Let us suppose that in 
mai n () we write 

i nt i , j, n; 

double **a; 

for ( ; ; ) { /* do it forever */ 

. /* get n from somewhere */ 

a = get_matrix_space(n, n) ; 
for (i =1; i <= m; ++i) 
for (j = 1; j <= n; ++j) 

a[i][j] = ... /* assign values */ 

. /* do something */ 

release_matrix_space(a, m); 

} 

Here is the function definition for release_matrix_space(). We have to be careful to 
undo the pointer offsets that occurred in get_matrix_space(), 

void release_matrix_space(double **a, int m) 

{ 

int i; 

for (i =1; i <= m; ++i) 
free(a[i] + 1); 
free(a + 1); 

} 


Allocating All the Memory at Once 

In certain applications, especially if the matrices are large, it may be important to allo¬ 
cate all the matrix space at once. At the same time, we want our matrices to be indexed 
from 1, not 0. Here is code that we can use to allocate space for our matrices: 

In file matrix.h 

#include <stdio.h> 

#include <stdlib.h> 

typedef double ** matrix; 

typedef double * row; 

typedef double elem; 
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matrix 
void 
void 
void 


get_matrix_space(int m, int n); 
release_matrix_space(matrix a); 
fill_matrix(matrix a, int m, int n); 
prn_matrix(const char *s, matrix a, int m, int n); 


Note that we can get different kinds of matrices just by changing the typedef 
priately. 

In file space.c 


#include "matrix.h" 

matrix get_matrix_space(int m, int n) 

int i; 

elem * p; 
matrix a; 

p = malloc(m * n * sizeof(elem)); 
a = malloc(m * sizeof(row)); 

—a; 

for (i = 1 ; i <= ffl; ++ i) 

a[i] = p + ((i - 1) * n) - 1; 
return a; 


/* get space all at once 
/* offset the pointer 


void release_matrix_space(matrix a) 

elem * p; 

p = (elem *) a[l] + l; 
free(p); 
free(a + 1); 


/* base address of the array 


Suppose that in some other function we write 

int m, n; 

matrix a; 


get_matrix_space(m, n); 


/* get m and n from somewhere 
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Here is how we can think of a in memory: 

Space for an mxn matrix allocated 
all at once and indexed from 1 , not 0 



Caution: If we use algorithms that swap rows and we want to deallocate our matrix 
space, then we have to be careful not to lose the base address of the array in memory. 
We may want to redesign our abstract data type (ADT) matrix. 


12.7 Returning the Status 

Throughout this text, at the end of every function definition for mai n() we have writ¬ 
ten the line 

return 0; 

In this section, we want to explain how the value returned from mai n () can be used by 
the operating system. From the viewpoint of the operating system, the value returned 
by a process is called its status. Processes include both programs and shell scripts. The 
operating system does not necessarily have to use the status that gets returned from a 
process. 
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In most operating systems, you communicate with the machine by clicking the mousl! 
or typing commands on the keyboard. When you give commands such as 


mm 


mm 


cc pgm.c date echo Beautiful! 


gg£ M 


mm 


you ait communicating via the shell. The shell (which is itself a program) interpret!® 
what you write and tries to execute your commands. There are three shells common]! 
round on UNIX systems: the Bourne shell sh, the C shell csh, and the Korn shell ksh. 

To show how the value returned from main() (the program status) can be used, W <§1 

are going to write a small program and then write a shell script that invokes our pm. jU 
gram. Here is our program: 




m 


m 


In file try_me.c 

#include <stcHo.h> 
#define MARKER 


,» 5i i 




■: 




-.5 






II 


» 


'3 


int main(int arqc 
int val: 


char **argv) 


WZ 




(( 


Input an integer: "); 


} 


printf("\n"); 
printf(MARKER 
scanf("%d", &val), 

printf(MARKER "Value being returned: %d\n\n", val): 
return val: 


far 


MM 












Note that main() can return any integer value, not just zero. Note also that we are 
using a marker to clearly indicate the lines that are written to the screen. 

Next, we compile this program, putting the executable file in a directory that is in our . 

Pcith. 


m m 


fe- 




gcc -o $base/bin/try_me try_me,c 
Now we are ready to write our shell script. 
In file gentry 


m 


- .. 

■ - - 


m 




. 

•' - • y> v l 




SS 




#!/usr/bin/csh 




: r . 


## 




## Experiment with the status 

ji jf 
JXjX 


mmsm 
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. 

SB!" 


12.7 ▼ Returning the Status 


581 


!Cho- 

echo At the top: status = $status 
t ry_me 
icho — 

echo After try_me: status = Sstatus 
try_me 

set val = Sstatus 
icho- 

echo After try_me again: val = Sval 
icho "" 
icho- 

echo To exit from the while loop, input 0. 
while (Sval) 
t ry_me 

set val = Sstatus 
echo In the loop: val = Sval 
ind 




Dissection of the go_try Shell Script 



m #!/usr/bin/csh 

A comment starts with a # and continues to the end of the line. This line is special, how¬ 
ever, because it begins with #! in column 1 at the top of the file. Lines such as this are 
read by the operating system. This line tells the operating system to use the C shell, 
which is / usr/bin/esh , to execute this file. 

■ ## Experiment with the status. 

Comments begin with a # and continue to the end of the line. 

■ echo - 

echo At the top: status = $status 

The echo command takes the remainder of the line and writes it to the screen. The 
symbol $, however, is special. The value of a shell variable is obtained by writing 


$ (sheU_variable_name ) 


The shell variable status is a built-in variable. 










Ay 


Vv-' • ‘C.- 1 - ;\i&*-/~S~-' : - ~ yfeSIl 

Kp l fe .'/. 

Sfckf* :;£;*. *?. •■: 

lllllfe \-M 

®s$s •• '• -i 
■HU ." ' 
W!ftfei8$& $sd| 
I 

/• v";- 

S&aKsf'ij* 


§P§& 5 ^ 11118 ® 
/& 

:: vS-"liffl| 

gfete :$•'•. aj 

xKr.-fss;- -SiS^sissi 

• m 




h.-,v,:?. 

i 
:* ■-:•:■:• . ' 


« Mi gi 

’is^m 

WlmMm 


H&fP? :••' " 


flroaawwm*/*' 

npg 


SS 5 @^< &-^£* 3 g 

HS8fet : llSlB 

fcf-Ill 

£ 

MfsiK••••'' vj-s 


Q&a&§ 

ISsIs? m 


WgSm- 

1111311 


582 Chapter 12 ▼ Advanced Applications 


Finally, we enter a loop so that we can experiment and see the results printed on the 
screen. 


■ try_me 
echo- 

echo After tryjne: status = Sstatus 


echo To exit from the while loop, input 0. 
while ($val) 
try_me 

set val = Sstatus 

echo In the loop: val = Sval 


At this point, we invoke our program try_me. After we have done this, we want to exam¬ 
ine the program status. (As we will see, our attempt fails.) 


After we invoke try_me again, we use the set command to create a shell variable named 
val that has, as its value, the current value of Sstatus. (As we shall see, val has the | 
value returned by mai n() in the program tryjne.) We use the echo command to display 
Sval on the screen. 


Our shell script go.try is not very informative by itself. When we execute it, however, a% 
lot of new ideas start to fall into place. To make the file executable, we give the com¬ 
mand ■ 

chmod u+x gojry 


Finally, we give the command 


gojry 

and enter some numbers when prompted. Flere is what appears on the screen: 


tryjne 

set val = Sstatus 
echo- 

echo After tryjne again: val 


= Sval 
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At the top: status = 0 

» Input an integer 
» Value being returned 


After tryjne: status = 0 

» Input an integer: 7 
» Value being returned: 7 


After tryjne again: val = 7 




Dissection of the Output From go_tty 


At the top: status = 0 


The lines that begin with » were written by the try_me program. The nonblank lines 
were written by echo commands in the shell script gojry. Every process resets the 
value of status upon exiting, and by convention 0 is used to signify normal termina¬ 
tion of a process. An echo command is itself a process. The very first one we did was 


echo — 

When this process terminated, it set the value of status to 0. This is reflected in the 
output line 

At the top: status = 0 

■ » Input an integer: 3 

» Value being returned: 3 

These are lines generated by the tryjne program. When prompted, we typed a 3 at the 
keyboard. The second line reports this. 
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After try_me: status = 0 

At first, this report surprised us, but then wc remembered that status is ephemeral 

The echo command that generated — reset the status after trv_me. Hence, the value 
that gets reported is 0, not 3. 

■ » Input an integer: 7 

» Value being returned: 7 

After try_me again: val = 7 

After try__me is invoked again, we enter 7 when prompted. Immediately after try_me 

exits, we use the shell variable val to capture the current value of $status. In the shell 
script, the relevant lines are 

t ry_me 

set val = Sstatus 
echo — 

echo After try_me again: val = $val 

The while loop in the shell script go_try allows the user to experiment. We observe 

that when we enter an integer n, the value that gets printed is n mod 256. In particular, 

1 yields 255. Thus, the program status is a nonnegative number between 0 and 255. 

When mai n() returns a value to the shell, the shell can use it for its own purposes if 
this is desirable. 

Although we used the UNIX C shell in our discussion in this section, the story is sim¬ 
ilar with respect to the other UNIX shells and v\1th respect to MS-DOS. 
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Summary 

1 A concurrent process is code that executes simultaneously with the code that 
invoked it. In UNIX, fork() can be used to create a child process that is a copy of 
the parent process, except that the child has its own unique process identification 
number. If a call to forl<() is successful, it returns 0 to the child and the child’s 
process identification number to the parent; if unsuccessful, it returns -1. 

2 In both MS-DOS and UNIX, a member of the exec... () family can be used to over¬ 
lay one process with another. There is no return to the parent. Another family, 
spawn... 0, is available in MS-DOS. With this family, it is possible to return to the 
parent. This use, in a certain sense, is comparable to the combined use of fork() 
and exec. .. () in UNIX. 

3 In UNIX, the system call pi pe(pd) creates a mechanism for interprocess communi¬ 
cations called a pipe. After a pipe is open, calls to fork() are used to create pro¬ 
cesses that communicate via the pipe. The functions readQ and write () are used. 

4 The function si gnal () is available in the standard library. It can be used to associ¬ 
ate a signal with a signal handler. The signal handler can be a function the program¬ 
mer writes to replace the default system action. When the signal is raised, program 
control passes to the signal handler. The set of signals that are handled by the oper¬ 
ating system is defined as macros in signal.h. This set is system-dependent, but 
some of the signals are common to both MS-DOS and UNIX. 

5 The dining philosophers problem is a standard model for synchronizing concurrent 
processes that share resources. The problem is to write a program with concurrent 
processes representing the philosophers, where each philosopher gets to eat (share 
resources) fairly often. 

6 A semaphore is a special variable allowing wait and signal operations. The variable 
is a special location for storing unspecified values. The wait operation expects one 
item and removes it. The signal operation adds one item. The wait operation blocks 
a process until it can accomplish its removal operation. The signal operation can 
start up a blocked process. 

7 By starting with the type pointer to pointer to double, we can build a matrix of 
whatever size we want, and we can pass it to functions that are designed to work on 
matrices of any size. This is an important idea for engineers and scientists. 
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Exercises 

1 Modify the simple forking program given in Section 12.1, “Creating a Concurrent 

rocess with forkO,” on page 556, so that it has three copies of the line 

value = fork(); 

The output of the program is nondeterministic. Explain what this means 
Hint: Execute your program repeatedly. 

2 rnnmin?tn ail f S ’rf 0 CMd Pr ° CeSS iS CTealed and _1 is returned - w rite a program that 
contains the following code: 

#define N 3 

for (i =1; i <= N; ++-j) { 
pid = fork(); 
if Cpid == 0) 

printf("%2d: Hello from child\n", i); 
else if (pid > 0) 

printf("%2d: Hello from parent\n", i); 
else ’ 

printf("%2d: ERROR: Fork did not occur\n", i); 

How large must N be on your system to get an error message printed? 

3 In Section 12.2, “Overlaying a Process: the exec... () Family," on page 560, we pre- 

sented a program that illustrates how one process gets overlaid with another. Mod- 

i y that program. Begin by creating another executable file, say pgm3, which prints 

the current date. Provide that as another choice for overlaying the parent process in 
the program you are modifying. 

4 D m eS th f command work on your system? If so, find out where the execut- 

a le code is. Then take one of your working programs and insert code similar to 

if (fork() == 0) 

execl("/usr/games/fortune", "fortune", 0 ); 

What is the effect of doing this? 

5 Write a program that uses n concurrent processes to multiply two n x n matrices. 
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6 Compile two programs into executables named progl and prog2. Write a program 
that executes both concurrently by forking twice and using execl () to overlay the 
two executables. 

7 When a pipe is filled to capacity on your system, how many characters does it hold? 
Write a program to find out. Hint: Write characters into the pipe until it blocks. 

8 Change the dining philosophers program so that upon receiving an interrupt it 
prints out how many times each philosopher has eaten. Also, experiment to see 
what happens if the pi ck_up() function is changed so that philosopher number 3 
picks up the right chopstick first and everyone else picks up the left chopstick first. 

9 How signals are handled is system-dependent. Try the following program on differ¬ 
ent systems to see what happens: 


#include <stdio.h> 
#include <signal.h> 
#include <math.h> 


/* HUGE_VAL defined here */ 


int main(void) 

{ 

double x = HUGE_VAL, y = HUGE_VAL; 


signal(SIGFPE, SIC_IGN); 
printf("Ignore signal: x * y = 
signal(SIGFPE, SIG_DFL); 
printf("Default signal: x * y 
return 0; 


= %e\n", x * y); 
= %e\n", x * y) 


10 Starting with a variable of type double **, we showed how a matrix can be built 
dynamically and then passed to functions. Do we really have to do it all dynami¬ 
cally? Consider the code 


int 

double 

double 


*a[3] , det; 

trace(double **, int); 


for (i =0; i <3; ++i) 

a[i] = calloc(3, sizeof(double)) 

A » it 9 * 

tr = trace(a, 3); 


/* function prototype */ 


/* fill matrix a [] [] */ 


The array a is being passed as an argument, but from the function prototype we see 
that an argument of type double * * is needed. Will your compiler complain? 
bx plain. 
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11 In the 1970s, before C was available to us, we had occasion to use PL/1, a langu| K e jj« 
that is commonly found on IBM mainframes. The PL/1 manuals always stressed® 5 *® 
that the compiler could create vectors and matrices that started at any desire! 
index. For example, a declaration of the form 


int automobi1es[1989 : 1999]; 


/* a PL/1 style declaration */&%H 


would create an array of size 11, with its index starting at 1989. This array could be 
used to store the number of automobiles sold, or projected to be sold, in the years!/ 
1989 to 1999. Perhaps one reason this concept was stressed was that FORTRAN and - 2 
other languages were not able to create such arrays. In C, of course, such an arrayf* 
can be created. Explain how this is done. 






12 Filling matrices with integers that are randomly distributed in the range from -N Rff 
+N\s convenient for testing purposes. If you are checking machine computations 
by hand, then a reasonable value for N is 2 or 3. Write a function that fills matrices. 1 

■iSMls 

void fi1l_matrix(double **a, int m, int n, int N) 

{ 




• jpg 




13 Let A = (a«) be an n x matrix. The determinant of A can be computed using GaussHp 
ian elimination. This algorithm requires that for k = 1 to n - 1 we do the following:Hat 
Start with and examine all the elements in its column that are below a^, includ¬ 
ing a kk itself. Among those elements find the one that is the largest in absoluteji 
value. This element is called the pivot. If the pivot is in the ith row, and z is no|gf 
equal to k, then interchange the ith and kth rows. (Keep track of the number of l Mi 
interchanges that are performed.) After this has been done, a^ will be the pivot. Jf|H$j 
this value is zero, then the value of the determinant is zero and the algorithm is fin- 5JJP 
ished. Otherwise, for i = k + 1 to n add a multiple of the kth row to the ith row,j 
where the multiple is given by (-1 / a^) x ajk- The final step is to take the product jiij 
of the diagonal elements of the matrix. If the number of row interchanges in the 
algorithm is even, then this product is the determinant. Otherwise, the negative of jj§§ 
this product is the determinant. In this exercise you are to write a program that 
computes the determinant of nxn matrices. In order to have subscripts that start 
at 1 rather than 0, use the function get_matrix_space(). Hint: Use hand simula¬ 
tion of the algorithm on a small matrix first so that you understand the details of 
the algorithm. 
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for (k = 1; k <= n; ++k) { 
find the pivot element 
if the pivot is zero, return zero 
if the pivot is in the ith row, and i ! = k, 

then interchange the i th row with the kth row 
and increment the interchange counter i_cnt 

pivot = a[k] [k]; 

for (i = k + 1; i < = n; ++"0 { 

multiplier = -a[i][k] * (1.0 / pivot), 

for (i = k; j <= n; ++j) 

a[i][j] += multiplier * a[k] [j]; 

} 

} 

if (i_cnt % 2 == 0) 
det = 1.0; 
el se 

det = -1.0; 

for (i =1; i <= n; ++i) 
det *= a[i] [i]; 
return det; 

One way to get a three-dimensional array is to make a declaration such as 
double a[9][2][7]; 

Suppose we want to pass this array to a function, and we want the function to work 
on three-dimensional arrays of various sizes. Because the function definition will 
require the 2 and the 7 to build the correct storage mapping function, lmpiement- 
ing a as an array of arrays will not work. Instead, we can wnite 

int i, j, nl, n2, n3; 

double ***a; 

/* get sizes from somewhere V 

a = calloc(nl, sizeof(int **)); 
for (i =0; i < nl: ++i) f 

a[i] = calloc(n2, sizeof(double ”))> 

for (j = 0; j < n2; ++j) . 

a[i][j] = calloc(n3, sizeof(double)); 

} 

Draw a picture of how a can be thought of in memory. Write a program that imple¬ 
ments this scheme. Your program should get the sizes from the user interactive y, 
allocate space for the three-dimensional array, and fill the array with randomly dis¬ 
tributed integers from a small range. Note that each a [i ] can be thought of as a 
matrix. Print the array on the screen by first printing a [0], then a [1], an so or . 
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Finally, print the sum of all the elements of the array. Use a function to 
this sum. 


15 Consider the following program: 

#include <stdio,h> 

#define N 3 

double trace(double *a[]); 

int main(void) 

{ 

double a[N][N] = fl, 2, 3, 4, 5, 6, 7, 8, 9}; 

printf("The trace of a is %.3f\n", trace(a)); 
return 0; 


double trace(double **a) 

int i, j; 

double sum = 0.0;; 

for (i = 0; i < N; ++i) /* N is hardwired in the body */ 

for (j =0; j < N; ++j) 
sum += a[i] [j]; 
return sum; 


The compiler complains about the call trace (a). If we change this to 
trace((double [N][N]) a) 
will the compiler be happy? Will the code work? Explain. 

16 Remote procedure call (RPC) is used in distributed systems to execute code on 
other machines on a network. First, the program running on the local machine, 
called a client, sends a request to the second machine, called a server. The server 
calls a routine to perform the requested service. Finally, the results of the request 
are returned to the client. The following code is based on Sun Microsystems RPC 
and was written by Darrell Long: 
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//SUN MICROSYSTEMS RPC code. 

// compile with cc pgc.c -Irpcsvc 

* j 

#include <stdio.h> 

#include <rpc/rpc.h> 

#include <rpcsvc/rstat.h> 

fit 

//Poll the host. 

V 

void do_poll(char *host) { 
int stat; 

struct statstime result_stats; 

stat = callrpc(host, RSTATPROG, RSTATVERS_TIME, 

RSTATPROC_STATS, 

xdr_void, 0, xdr_statstime, &result_stats); 
if (stat == RPC_SUCCESS) 

fprintf(stdout, "DATA %s %ld %ld\n", host, 
resul t_stats.boottime.tv_sec, 
result_stats.cu rtime.tv_sec); 

} 

int main(void) 

{ 

do_poll ("machine.school. edu") ; /*p oiled machine */ 

return 0; 

} 

Run this code across your local network. Look in the file 

/usr/include/rpcs vc/rsta t. h 
to figure out what these statistics mean. 
































This chapter gives an overview or the C++ programming language. It also provides an 
introduction to C++’s use as an object-oriented programming language. In the chapter, 
a series of programs is presented, and the elements of each program are carefully 
explained. The programs increase in complexity, and the examples in the later sections 
illustrate some of the concepts of object-oriented programming. 

The examples in this chapter give simple, immediate, hands-on experience with key 
features of the C++ language. The chapter introduces the reader to stream I/O, operator 
and function overloading, reference parameters, classes, constructors, destructors, 
templates, and inheritance. Mastery of the individual topics requires a thorough read¬ 
ing of a companion book such as either Pohl, C++ for C Programmers, 2d ed (Redwood 
City, CA: Benjamin/Cummings, 1993) or Pohl, Object-Oriented Programming Using C++, 
2d ed (Reading, MA: Addison-Wesley, 1997). 

Object-oriented programming is implemented by the class construct. The class 
construct in C++ is an extension of struct in C. The later examples in this chapter 
illustrate how C++ implements OOP (object-oriented programming) concepts, such as 
data hiding, ADTs, inheritance, and type hierarchies. 
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13.1 Output 

Programs must communicate to be useful. Our first example is a nrogram that 
on e screen the phrase "C++ is an improved C,” The complete program is 

In file improved.cpp 

// A ^llustrating output. 

// Author: Richmond Q. Programmer 

#include <iostream.h> 

int mainf) 

{ 

, cout « "C++ is an improved C.\n"; 

The program prints on the screen 
C++ is an improved C. 
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Prints 
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Mil cout « "C++ is an improved C.\n"; 

K 

|J This statement prints to the screen. The identifier cout is the name of the standard 
!§output stream. The operator << passes the string "C++ is an improved C.\n" to 
p. standard out. Used in this way the output operator « is referred to as the put to or 
1 insertion operator. 
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Dissection of the improved Program 

■ //A first C++ program illustrating output. 

Sp 7/ , iS a new comment symbol. The comment runs to the end of the 

“° ,d C brack0,lng «—• «»*«* /* V are s,il, avaUabk for » 

■ #include <iostream.h> 

The lostream.h header introduces I/O facilities for C++. 

■ int mainf) 

innni.fpT 131 !parentheses always mean main(void), never main( ) C++ stvle is 

not to use the redundant voi d for declaring the empty argument list. ' ° 


K 


1111 


We can rewrite our first program as follows: 

// A first C++ program illustrating output. 

#include <iostream.h> 

main() 

{ 

cout « "C++ is an improved C." « end!; 

} 

Although it is different from the first version, it produces the same output. This version 
drops the explicit declaration of main() as returning an i nt and uses the fact that this 
return type is implicit. Here we use the output operator « put to twice. Each time the 
« is used with cout, printing continues from the position where it previously left off. 
In this case, the identifier endl forces a new line followed by a flush. The end! is called 
a manipulator. 


13.2 Input 

We will write a program to convert to kilometers the distance in miles from the Earth to 
the moon. In miles this distance is, on average, 238,857 miles. This number is an inte¬ 
ger. To convert miles to kilometers, we multiply by the conversion factor 1.609, a real 
number. 

Our conversion program will use variables capable of storing integer values and real 
values. In C++, all variables must be declared before their use, but unlike in C, they need 
not be at the head of a block. Declarations may be mixed in with executable statements. 
Their scope is from the point of the declaration to the end of the block within which 
they are declared. Identifiers should be chosen to reflect their use in the program. In 
this way, they serve as documentation, making the program more readable. 
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ip® 


i ? e f' P*°8 ldnis assume a 4-byte int, but on some machines these variables 
e declared 1 ong. You can check the constant INT_MAX in limits.h. 


plj Mg "; ■ 

Sk(-A- @ 


should 


In i'ile moon.cpp 

// The distance to the moon converted to kilometers 
// Title: moon 

#include <iostream.h> 


int main() 

{ 

const int moon = 238857; 

cnul T: " T !V 10 °»' S dis 5 dnce from Earth is " « moon 
cout « miles. « endl; 

int moon_kilo = moon * 1.609; 

rnni T h 1 ! 0 ^ 1omete rs this \s " « moon_ki 10; 

cout << km. « end! ,* 


The output of the program is 

The moon's distance from Earth is 238857 miles 
In kilometers this is 384320 km. 
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Dissection of the moon Program 

■ const int moon = 238857; 

The keyword const is new in C++. It replaces some uses of the preprocessor command 

initialized vah^ 7™^ U§ing tlliS type modifier informs the compiler that the 

initialized value of moon cannot be changed. Thus, it makes moon a symbolic constant. 

" cout <<: " The moon's distance from Earth is " « moon; 

S\dditrn^vl!-mH Can d ; scriminate am °ng a variety of simple values without need- 

ng information. Here the value of moon is printed as an integer. 
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i int moon_kilo = moon * 1.609; 


Declarations can occur after executable statements. This allows declarations of vari¬ 
ables to be nearer to their use. 



Let us write a program that will convert a series of values from miles to kilometers. 
The program will be interactive. The user will type in a value in miles, and the program 
will convert this value to kilometers and print it out. 

In file mi_km.cpp 

// Miles are converted to kilometers. 

// Title: mi_km 

#include <iostream.h> 

const double m_to_k = 1.609; 

inline int convert(int mi) { return (mi * m_to_k); } 


int main() 

{ 

int miles; 
do { 

cout « "Input distance in miles: "; 
cin » miles; 

cout « "\nDistance is " « convert(miles) « " km." 

« endl; 

} while (miles > 0); 

} 

This program uses the input stream variable ci n, which is normally standard input. 
The input operator » is called the get from or extraction operator, which assigns values 
from the input stream to a variable. This program illustrates both input and output. 
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Dissection of the mi_km Program 


m 




const double m_to_k = 1.609; 


m 


C++ reduces C’s traditional reliance on the preprocessor. For example, instead of hav¬ 
ing to use define, special constants, such as the conversion factor 1.609, are simply 
assigned to variables specified as constants. 
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inline int convertfint mi) { return (mi * m_to_k); } 


-- 

- . . 


• •am 


The new keyword i n 1 i ne specifies that a function is to be compiled, if possible as 
inline code. This avoids function call overhead and is better practice than C’s use of 
defi ne macros. As a rule, i nl i ne should be done sparingly and only on short func¬ 
tions. Also note how the parameter mi is declared within the function parentheses. C++ 
uses function prototypes to define and declare functions. This will be explained in the 
next section. 


do { 


»» , 


cout « "Input distance in miles: 
cin » miles; 

cout « "\nDistance is " « convert(miles) « " km." 
« endl; 

} while (miles > 0); 


The program repeatedly prompts the user for a distance in miles. The program is termi¬ 
nated by a zero or negative value. The value placed in the standard input stream is 
automatically converted to an integer value assigned to mi 1 es. 
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13.3 Functions 

The syntax of functions in C++ inspired the new function prototype syntax found in 
Standard C compilers. Basically, the types of parameters are listed mside the header 
parentheses. By explicitly listing the type and number of arguments, strong type check¬ 
ing and assignment-compatible conversions are possible in C++. 

C++ allows functions to have arguments directly called by reference. Call-by-refer¬ 
ence parameters are declared using the syntax 

type& identifier 

Also, C++ function parameters can have default values. These are given in the function 
declaration inside the parameter list by 

= expression 

added after the parameter. 

The following example illustrates these points: 

In file add3.cpp 

// Use of a default value 
#include <iostream.h> 

inline void add3(int& s, int a, int b, int c = 0) 

{ 

s = a + b + c; 

} 


inline double average(int s) { return s / 3.0; } 
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int mainC) 

{ 

int score_l, score_2, score_3, sum; 

cout « "\nEnter 3 scores: 
c in » score_l » score_2 » score_3; 
add3(sum, score_l, score_2, score_3); 
cout « "\nSum = " « sum; 

"\nAverage = " « average(sum) « endl; 
add3Csum ( 2 * score.l, score_2); // use of default value 0 
cout << ^\nWeighted Sum = 1 « sum << 11 , ,T j 

cout « '\nWeighted Average = " « average(sum) « ”.\n"; 
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Dissection of the add.3 Program 

■ inline void add3(int& s, int a, int b, int c = 0) 

{ 

s = a + b + c; 

} 

The variable s is call-by-reference. An actual argument passed in must be an lvalue,; 
Decause that will be the actual address used when the procedure is called. 

a add3(sum, score_l, score_2, score_3); 

The variable sum is passed-by-reference. Therefore, it is directly manipulated and can 
be used to obtain a result from the function’s computation. 

b add3(sum, 2 * score_l, score_2); // use of default value 0 
deTaultf tVvakie zer^ 1 aVgUmGntS are used in callin § add3(). The fourth argument 
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13.4 

What is novel about C++ is its aggregate type cl ass. A cl ass is an extension of the idea 
|of struct in traditional C. A class provides the means for implementing a user- 
defined data type and associated functions and operators. Therefore, a class can be 
used to implement an ADT. Let us write a cl ass called stri ng that will implement a 
restricted form of string. 

In file my_string.cpp 

// An elementary implementation of type string. 

#include <string.h> 

^include <iostream.h> 

const int max_len = 255; 
class string { 

public: // universal access 

void assign(const char* st) 

{ strcpyCs, st); len = strlen(st); > 
int lengthO { return len; } 

void printO { cout « s « "\nLength: " « len « "\n"; } 
private: // restricted access to member functions 

char s[max_len]; // implementation by character array 
int len; 

}; 

Two important additions to the structure concept of traditional C are found in this 
example: first, it has members that are functions, such as assign, and second, it has 
both public and private members. The keyword public indicates the visibility of the 
members that follow it. Without this keyword, the members are private to the class. Pri¬ 
vate members are available for use only by other member functions of the class. Public 
members are available to any function within the scope of the class declaration. Privacy 
allows part of the implementation of a class type to be “hidden.” This restriction pre¬ 
vents unanticipated modifications to the data structure. Restricted access, or data hid¬ 
ing, is a feature of object-oriented programming. 

The declaration of member functions allows the ADT to have particular functions act 
on its private representation. For example, the member function 1 ength returns the 
length of the string defined to be the number of characters up to but excluding the first 
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zero value character. The member function print() outputs both the string and its 
length. The member function assign() stores a character string into the hidden \ 
able s and computes and stores its length in the hidden variable 1 en. 

We can now use this data type stri ng as if it were a basic type of the language, 
obeys the standard block structure scope rules of C. Other code that uses this type is 
client. The client can use only the public members to act on variables of type stri ng. 

// Test of the class string. 


int main() 

{ 

string one, two; 
char three[40] 


{"My name is Charles Babbage."}; 


one.assign("My name is Alan Turing."); 
two.assign(three); 
cout « three; 

cout « "\nLength: " « strlen(three) « end!; 
// Print shorter of one and two. 
if (one.length() <= two.length()) 
one.printQ; 
else 

two.print(); 


The variables one and two are of type stri ng. The variable three is of type pointer 
to char and is not compatible with stri ng. The member functions are called using the 
dot operator or “structure member operator.” As is seen from their definitions, these 
member functions act on the hidden private member fields of the named variables. One 
cannot write inside mai n the expression one. 1 en expecting to access this member. The 
output of this example program is 

My name is Charles Babbage. 

Length: 27 

My name is Alan Turing. 

Length: 23 
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13.5 Overloading 

The term overloading refers to the practice of giving several meanings to an operator or 
a function. The meaning selected depends on the types of the arguments used by the 
operator or function. Let us overload the function pri nt in the previous example. This 
will be a second definition of the pri nt function. 

class string { 

public: // universal access 


void printQ { cout « s « "\nLength: " « len « "\n"; } 
void print(int n) 

{ 

for(int i =0; i < n; ++i) 
cout « s « endl; 

} 

* 4 * * * 

} 

This version of print takes a single argument of type int. It will print the string n 
times. 

three.print(2); // print string three twice 

three.print(-l); // string three is not printed 

It is also possible to overload most of the C operators. For example, let us overload + 
to mean concatenate two strings. To do this we need two new keywords: f ri end and 
operator. The keyword operator precedes the operator token and replaces what 
would otherwise be a function name in a function declaration. The keyword friend 
gives a function access to the private members of a class variable. A f ri end function is 
not a member of the class but has the privileges of a member function in the class in 
which it is declared. 















































e operator + 


h> 

n. h> 

= 255; 


1 st char* st) { strcpy(s, st); len = strlen(st) 
[ return len; } ' 

cout <<c s << M \nLength: " « len << endl; } 
)perator+(const strings a, const string& b); 


onst strings a, const strings b) // overload 


); 

n + b.len; 
max__l en) 
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int main() 

{ 

string one, two, both; 

char three[40] = ("My name is Charles Babbage."}; 

one.assign("My name is Alan Turing."); 
two.assign(three); 

print(three); // file scope print called 

// Print shorter of one and two. 
if (one.lengthQ <= two.lengthO) 

one.print(); // member function print called 

el se 

two.print(); 

both = one + two; // plus overloaded to be concatenate 
both.print(); 





Dissection of the operator+ Function 

■ string operator+Cconst string& a, const string^ b) 

Plus is overloaded. The two arguments it will take are both strings. The arguments are 
call-by-reference. Use of const indicates that the arguments cannot be modified. 

■ string temp; 

The function needs to return a value of type stri ng. This local variable will be used to 
store and return the concatenated string value. 

■ temp.assign(a.s); 

temp.len = a.len + b.len; 
if (temp.len < max_len) 
strcat(temp.s, b.s); 

The string a.s is copied into temp.s by calling the strcpyQ library function. The 
lenglh of the resulting concatenated string is tested to see that it does not exceed the 
maximum length for strings. If the length is acceptable, the standard library function 
strcatO is called with the hidden string members temp.s and b.s. The references to 
temp.s, a.s, and b.s are allowed because this function is a f ri end of class stri ng. 








606 


Chapter 1 3 T Moving from C to C+-f 


lipl§W?A 

fe' 

JBBteg 

Tlifetig 


■ cerr « "Max length exceeded in concatenation.\n"; 

The standard error stream rprr io + 

tion takes place. Only the first string wiS beXnTd '™**”** ^ n ° Concate »l 
® return temp; 

The operator was tnven a return timp * __ , 

priate concatenated string. an tem ^ bas bcen assigned the appro 


’ r- ■ ■ mm 




IBM 

iiii 



13.6 Constructors and Destructors 


// An implementation of dynamically allocated strings 


class string { 
public: 

wh r i ngCi ^ ^ "f s = new char[n + 11; len - n- i // r- 
void assign(const char* st) J en “ n > * // constructor 

int ( lInnrhn’r St): len = strl en(st); } 

Tnt lengthO { return len: } 

VOld print() { COUt « 5 « ,f \n[ pnnth- M i 

friend string operator+fconst striS' a ] f n <<: . ; } 

private; +vconst string^ a, const string^ b) ■ 
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HJ A constructor is a member function whose name is the same as the class name. The 
lip keyword new is an addition to the C language. It is a unary operator that takes as an 
P y argument a data type that can include an array size. It allocates the appropriate amount 
Ipi'of memory from free store to store this type, and returns the pointer value that 
8jJ| addresses this memory. In the preceding example, n + 1 bytes would be allocated from 
|§|g free store. Thus, the declaration 

Ilf string a(40), b(100); 

fiffe: 

lit would allocate 41 bytes for the variable a, pointed at by a. s , and 101 bytes for the vari- 
Jif able b, pointed at by b.s. We add 1 byte for the end-of-string value 0. Storage obtained 
pf by new is persistent and is not automatically returned on block exit. When storage 
% return is desired, a destructor function must be included in the class. A destructor is 
jjjjt written as an ordinary member function whose name is the same as the class name pre- 
Pl ceded by the tilde symbol ~ . Typically, a destructor uses the unary operator del ete or 
pf del ete [], another addition to the language, to automatically deallocate storage associ- 
|gj ated with a pointer expression. The deleted is used anytime new typelsize] was 
1§ used for allocation. 

| // Add as a member function to class string. 

|| ~string() { delete []s; } // destructor 

|j| 

|s It is usual to overload the constructor, writing a variety of such functions to accom- 
Jf modate more than one style of initialization. Consider initializing a string with a 
J§ pointer to char value. Such a constructor is 

I string(const char* p) 

I { 

| len = strlen(p); 

s = new char[len + 1]; 

I strcpy(s, p); 

f } 

A typical declaration invoicing this version of the constructor is 

char* str = "I came on foot."; 
string a("I came by bus."), b(str); 


It would also be desirable to have a constructor of no arguments: 


stringO { len = 255; s = new char[255]; } 
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This would be invoked by declarations without parenthesized arguments and would, by 
default, allocate 255 bytes of memory. Now all three constructors would be invoked in 
the following declaration: |j 

string a, b(10), c("I came by horse."); jj 

The overloaded constructor is selected by the form of each declaration. The variable a 
has no parameters and so is allocated 255 bytes. The variable b has an integer parame¬ 
ter and so is allocated 11 bytes. The variable c has a pointer parameter to the literal 
string "I came by horse. " and so is allocated 17 bytes, with this literal string copied 
into its private s member. 


13.7 Object-oriented Programming and Inheritance 

A novel concept in OOP is the inheritance mechanism. This is the mechanism of deriv¬ 
ing a new 7 class from an existing one called the base class. The derived class adds to or 
alters the inherited base class members. This is used to share code and interface and to 
create a hierarchy of related types. 

Hierarchy is a method for coping with complexity. It imposes classifications on 
objects. For example, the periodic table of elements has elements that are gases. These 
have properties that are shared by all elements in that classification, Inert gases are an 
important special class of gases. The hierarchy here is as follows: An inert gas, such as 
argon, is a gas, which, in turn, is an element. This hierarchy provides a convenient way 
to understand the behavior of inert gases. We know they are composed of protons and 
electrons, as this is shared description with all elements. We know they are in a gaseous 
state at room temperature, as this behavior is shared with all gases. We know they do 
not combine in ordinary chemical reactions with other elements, as this is shared 
behavior with all inert gases. 

Consider designing a data base for a college. The registrar must track different types 
of students. The base class we need to develop captures a description of “student.” Two 
main categories of student are graduate and undergraduate. 

Here is the OOP design methodology: 
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OOP Design Methodology 

1 Decide on an appropriate set of types. 

2 Design in their relatedness. 

3 Use inheritance to share code. 

An example of deriving a class is as follows: 

enum support { ta, ra, fellowship, other }; 
enum year { fresh, soph, junior, senior, grad 1, 

class student { 
public: 

studentfchar* nm, int id, double g, year x), 
void print(); 
private: 

int student_id; 

double gpa; 

year y; 

char name[30]; 

}; 

class grad_student: public student { 

public: 

arad_student 

(char* nm, int id, double g, year x, support t, 
char* d, char* th); 
void printO; 
private: 

support s; 

char dept[10]; 

char thesis[80]; 

}; 


In this example, grad_student is the derived class, and student is the base class. The 
use of the keyword pub! i c following the colon in the derived class header means that 
the public members of student are to be inherited as public members of 
qrad_student. Private members of the base class cannot be accessed in the derived 
class. Public inheritance also means that the derived class grad_student is a subtype 

of student. 

An inheritance structure provides a design for the overall system. For example, a 
data base that contained all the people at a college could be derived from the base class 
person. The student-grad_student relation could be extended to extension students, 
as a further significant category of objects. Similarly, person could be the base class lor 

a variety of employee categories. 
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A polymorphic function has many forms. An example in Standard C is the division oper¬ 
ator. If the arguments to the division operator are integral, then integer division is used!® 
However, if one or both arguments are floating-point, then floating-point division is 
used. r f,. 

In C++, a function name or operator is overloadable. A function is called based on its 
signature, defined as the list of argument types in its parameter list. 1 
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// divide behavior determined by native coercions 
// overloading « the shift operator for output 
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In the division expression, the result depends on the arguments being automatically 
coerced to the widest type. So if both arguments are integer, the result is an integer 
division. If one or both arguments are floating-point, the result is floating-point. In the 
output statement, the shift operator « is invoking a function that is able to output an 
object of type a. | 

Polymorphism localizes responsibility for behavior. The client code frequently 
requires no revision when additional functionality is added to the system through ADT- 
provided code improvements. 

In C, the technique for implementing a package of routines to provide an ADT shape 
would rely on a comprehensive structural description of any shape. 
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struct shape { 
enum{CIRCLE, ■ 
double center, 


••■} e_val; 
radius; 




would have all the members necessary for any shape currently drawable in our system, 

plus an enumerator value so that it can be identified. The area routine would then be , 
written as 

double area(shape* s) 

{ Ai 




switch(s -> e_val) { 

case CIRCLE; return (PI * s -> radius * s -> radius); 
case RECTANGLE; return (s -> height * s -> width); 
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Question: What is involved in revising this C code to include a new shape? Answer: An 
additional case in the code body and additional members in the structure. Unfortu¬ 
nately, these would have ripple effects throughout our entire code body. Each routine 
so structured has to have an additional case, even when that case is just adding a label 
to a preexisting case. Thus, what is conceptually a local improvement requires global 
changes. 

OOP coding techniques in C++ for the same problem use a shape hierarchy. The hier¬ 
archy is the obvious one where circle and rectangle are derived from shape. The revi¬ 
sion process is one in which code improvements are provided in a new derived class, so 
additional description is localized. The programmer overrides the meaning of any 
changed routines—in this case, the new area calculation. Client code that does not use 
the new type is unaffected. Client code that is improved by the new type is typically 
minimally changed. 

C++ code following this design uses shape as an abstract base class. This is a class 
containing one or more pure virtual functions. A pure virtual function does not have a 
definition. The definition is placed in a derived class. 

// shape is an abstract base class 

class shape { 
public: 

virtual double area() = 0; // pure virtual function 

}; 

class rectangle: public shape { 
public: 

rectanglefdouble h, double w): height(h), width(w) {} 

double area() { return (height * width); } // overridden 

private: 

double height, width; 

}; 

class circle: public shape { 
public; 

circle(double r): radius(r) {} 

double area() { return ( 3.14159 * radius * radius); } 
private: 

double radius; 

}; 

Client code for computing an arbitrary area is polymorphic. The appropriate area() 
function is selected at run-time. 












13.9 Templates 

C++ uses the keyword template to provide parametric polymorphism. Parametric poly¬ 
morphism allows the same code to be used with respect to different types, where the 
type is a parameter of the code body. The code is written generically to act on cl ass T. 
The template is used to generate different actual classes when class Tis substituted 
for with an actual type. 

An especially important use for this technique is in writing generic container classes. 
A container class is used to contain data of a particular type. Stacks, vectors, trees, and 
lists are all examples of standard container classes. We shall develop a stack container 
class as a parameterized type. 
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shape* ptr_shape; 

*««»■' 

cout « " area = " « ptr_shape -> areaQ; 


Now imagine improving our hierarchy of types by developing a square class. 

class square: public rectangle { 
public: 

squarefdouble h): rectangle(h,h) {} 
double areaQ { return rectangle: :area(); } 

}; 

The client code remains unchanged. This would not have been the case with the 
OOP code. 
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In file stack.epp 

// template stack implementation 

template <class TYPE> 
class stack { 
public: 

stackCint size = 1000) :max_len(size) 

{ s = new TYPE[size]; top = EMPTY; } 

~stack() { delete [Js; } 

void resetO { top = EMPTY; } 

void push(TYPE c) { s[++top] = c; } 

TYPE pop() { return s[top--]; } 

TYPE top_of() { return s[top]; } 
bool emptyO { return top == EMPTY; } 
bool full() { return top == max_len - 1; } 
private; 

enum {EMPTY = -1}; 

TYPE* s; 
int max_len; 
int top; 

}; 

The syntax of the class declaration is prefaced by 
template cclass identifier > 

This identifier is a template argument that essentially stands for an arbitrary type. 
Throughout the class definition, the template argument can be used as a type name. 
This argument is instantiated in the actual declarations. An example of a stack declaia- 

tion using this is 

stack<char> stk_ch; // 1000 element char stack 

stack<char*> stk„str(200); // 200 element char* stack 

stack<complex> stk_cmplx(100); // 100 element complex stack 

This mechanism saves us rewriting class declarations where the only variation would be 
type declarations. 
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When processing such a type, the code must always use the angle brackets as part of 
the declaration. Here are two functions using the stack template: iff 

// Reversing a series of char* represented strings % 

void reverse(char* str[], int n) 

stack<char*> stk(n); // this stack holds char* 

for (int i =0; i < n; ++i) - 

stk.push(str[i]); 'k 

for (i = 0; i < n; ++i) 1 

str[i] = stk.popO; 
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In function reverseO, a stacl«char*> is used to insert n strings and then pops 
in reverse order. 

//Initializing a stack of complex numbers from an array 
void init(complex c[], stack<complex>& stk, n) 

for (int i =0; i < n; ++i) 
stk.push(c[i]); 

} 


In function i ni t(), a stackccompl ex> variable is passed by reference, and n complefi 


numbers are pushed onto this stack. Notice that we used bool in this example. It isjf 
basic type in C++ whose values are true and false. 
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13.10 C++Exceptions 


C++ introduces an exception-handling mechanism that is sensitive to context. The con- / 
text for raising an exception will be a try block. Handlers declared using the keyworljft 
catch are found at the end of a try block. ft 

An exception is raised by using the throw expression. The exception will be handledp 
by invoicing an appropriate handler selected from a list of handlers found immediately!! 
after the handler’s try block. A simple example of all this is JU 
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// stack constructor with exceptions 
stack::stack(int n) 

! throv/cn); // want a positive value 

p = new char[n]; // create a stack of characters 

if (p == 0) //if new returns 0 when it fails 

throw ("FREE STORE EXHAUSTED"); 

} 


void g() 

{ 

try { 

stack a(n), b(n); 

*#■#*» 

} 

catch (int n) { .... } // an incorrect size 

catch (char* error) { ... } // free store exhaustion 

} 

The first throwQ has an integer argument and matches the catch(i nt n) signa¬ 
ture. This handler is expected to perform an appropriate action where an incorrect 
array size has been passed as an argument to the constructor. For example, an error 
message and abort are normal. The second throw() has a pointer to char argument 
and matches the catch (char* error) signature. 

Modern ANSI C++ compilers throw the standard exception bad_alloc if new fails. 
Older systems returned the null pointer value 0, when new failed. 


13.11 Benefits of Object-oriented Programming 

The central element of OOP is the encapsulation of an appropriate set of data types and 
their operations. The class construct, with its member functions and data members, 
provides an appropriate coding tool. Class variables are the objects to be manipulated. 

Classes also provide data hiding. Access privileges can be managed and limited to 
whatever group of functions needs access to implementation details. This promotes 
modularity and robustness. 

Another important concept in OOP is the promotion of code reuse through the inher¬ 
itance mechanism. This is the mechanism of deriving a new class from an existing one 
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Summary 

1 The double slash // is a new comment symbol. The comment runs to the end of the 

line. The old C bracketing comment symbols /* */ are still available for multiline 

comments. 

2 The iostream.h header introduces I/O facilities for C++. The identifier cout is the 
name of the standard output stream. The operator « passes its argument to stan¬ 
dard out. Used in this way, the «is referred to as the put to operator. The identifier 
cin is the name of the standard input stream. The operator » is the input opera¬ 
tor, called get from , that assigns values from the input stream to a variable. 

3 C++ reduces C’s traditional reliance on the preprocessor. Instead of using defi ne, 
special constants are assigned to variables specified as const. The new keyword 
i nl i ne specifies that a function is to be compiled inline to avoid function call over¬ 
head. As a rule, this should be done sparingly and only on short functions. 

4 The syntax of functions in C++ inspired the new function prototype syntax found in 
Standard C compilers. Basically, the types of parameters are listed inside the header 
parentheses—for example, void add3(int&, int, int, int). Call-by-reference 
is available as well as default parameter values. By explicitly listing the type and 
number of arguments, strong type checking and assignment-compatible conver¬ 
sions are possible in C++. 

5 What is novel about C++ is the aggregate type cl ass. A cl ass is an extension of the 
idea of struct in traditional C. Its use is a way of implementing a data type and 
associated functions and operators. Therefore, a class is an implementation of an 
abstract data type (ADT). There erne two important additions to the structure con¬ 
cept: first, it includes members that are functions, and second, it employs access 
keywords public, private, and protected. These keywords indicate the visibility 
of the members that follow. Public members are available to any function within the 
scope of the class declaration. Private members are available for use only by other 
member functions of the class. Protected members are available for use only by 
other member functions of the class and by derived classes. Privacy allows part of 
the implementation of a class type to be “hidden.” 

6 The term overloading refers to the practice of giving several meanings to an opera¬ 
tor or a function. The meaning selected will depend on the types of the arguments 
used by the operator or function. 
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7 A constructor is a member function whose job is to initialize a variable of its class. It 
In many cases, this involves dynamic storage allocation. Constructors arc invoked 
any time an object of its associated class is created. A destructor is a member fufjjfl 
tion whose job is to finalize a variable of its class. The destructor is invoked implic- T 
itly when an automatic object goes out of scope. 






8 The central element of object-oriented programming (OOP) is the encapsulation ojlltl 
an appropriate set of data types and their operations. These user-defined types are 
ADTs. The class construct, with its member functions and data members, provid£njy 


an appropriate coding tool. Class variables are the objects to be manipulated. 


an 


a 


9 Another important concept in OOP is the promotion of code reuse through th&lt 
inheritance mechanism. This is the mechanism of deriving a new class from an 
existing one, called the base class. The base class can be added to or altered to crij|§ 
ate the derived class. In this way, a hierarchy of related data types can be created?® 
that share code. This typing hierarchy can be used dynamically by vi rtual func¬ 
tions. Virtual member functions in a base class are overloaded in a derived class. ;!! 
These functions allow for dynamic or run-time typing. A pointer to the base class 
can also point at objects of the derived classes. When such a pointer is used t ftf jj H 
point at the overloaded virtual function, it dynamically selects which version of the 
member function to call. 




sSA' 


. - ' • - - •- 


10 A polymorphic function has many forms. A vi rtual function allows run-time selec¬ 
tion from a group of functions overridden within a type hierarchy. An example in 
the text is the area calculation within the shape hierarchy. Client code for comput-J 
ing an arbitrary area is polymorphic. The appropriate area() function is selected at f 
run-time. 
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11 C++ uses the keyword template to provide parametric polymorphism. Parametric 
polymorphism allows the same code to be used with respect to different types, 
where the type is a parameter of the code body. The code is written generically to 
act on class T. The template is used to generate different actual classes when 
class T is substituted for with an actual type. 

12 C++ introduces an exception-handling mechanism that is sensitive to context. The 
context for raising an exception will be a try block. Handlers declared using the 
keyword catch are found at the end of a try block. An exception is raised by using 
the throw expression. The exception will be handled by invoicing an appropriate 
handler selected from a list of handlers found immediately after the handler’s try 
block. 
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Exercises 

1 Using stream I/O, write on the screen the words 

she sells sea shells by the seashore 
(a) all on one line, (b) on three lines, (c) inside a box. 

2 Write a program that will convert distances measured in yards to distances mea 
sured in meters. The relationship is 1 meter equals 1.0936 yards. Write the program 
to use ci n to read in distances. The program should be a loop that does this calcu¬ 
lation until it receives a zero or negative number for input. 

3 The following program reads in three integers and prints their sum. Observe that 
the expression ! ci n is used to test whether input into a, b, and c succeeded. To exit 
the for loop, the user can type bye or exit, or anything else that cannot be con¬ 
verted to an integer. Experiment with the program so that you understand its 

effects. 

#include <iostream.h> 

int mainO 

{ 

int a, b, c, sum; 

Coutl <X 11 — — — \^H* 1 

"Integers a, b, and c will be summed.\n" 

"\n"; 

for C ; ; ) { ,,. 

cout « "Input a, b, and c: ; 

cin » a » b » c; 

if (!cin) 
break; 

sum = a + b + c; 

cout « ”\n" 

" a + b + c = " « sum « \n 

u.. . 

} 
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4 Some C++ systems provide a “big integer” type. Tn GNU C++, for example, this t vUS 
is called Integer. We will write a program that uses this type to compute factorial 
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#include <assert.h> 
#include <iostream.h> 
#include <Integer.h> 


int main() 

{ 

int 

int 

Integer 


// valid for GNU g++\ 
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cout « "The factorial of n will be computed.\n" 

"\n" 

"Input n: "; 

cin » n; 

assert(cin && n >= 0); 
for (i =2; i <= n; ++i) 
product *= i; 
cout « "\n" 

"factorial(" « n « ") = " « product « "\n 
"\n"; 
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Observe that the program looks similar to a factorial program written in C. How 
ever, when we execute this program, we learn, for example, that fj 

factorial(37) = 13763753091226345046315979581580902400000000 

Using a type such as int or double we cannot generate such large integers, but 
with a big integer type we can. If GNU C++ is available to you, try this program. Oth¬ 
erwise, find out if your C++ system has a big integer type, and if so, write a factorial 
program similar to this one. If your program computes 100 factorial correctly, it 
should end with 24 zeros. Does it? 

> Take a working program, omit each line in turn, and run it through your compiler. 
Record the error messages each such deletion causes. For example, use the code 
given in exercise 3. 

5 Write a program that asks interactively for your name and age and responds with 
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Hello name, next year you will be next_age. 



where next_age is age + 1. 
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7 Write a program that prints out a table of squares, square roots, and cubes. Use 
either tabbing or strings of blanks to get a neatly aligned table. 

1 i * i square root i * i * i 

1 1 1.00000 1 

2 4 1.41421 8 

41 • » % * 

8 The C swapping function is 

void swap(int *i, int *j) 

{ 

int temp; 

temp = *i; 

*i = *j; 

*j = temp; 

} 

Rewrite this using reference parameters and test it. 
void swap(int& i, int& j); 

9 In traditional C, but not in ANSI C or C++, the following code causes an error: 

#include <math.h> 

#incl ude <stdio,h> 

int main() 

printf("%f is the square root of 2.\n", sqrt(2)); 
return 0; 

} 

Explain the reason for this and why function prototypes in C++ avoid this problem. 
Rewrite using iostream.h. 

10 Add to the class stri ng in Section 13.4, “Classes and Abstract Data Types,” on page 
601, a member function reverse. This function reverses the underlying representa¬ 
tion of the character sequence stored in the private member s. 

11 Add to the class stri ng in Section 13.4, “Classes and Abstract Data Types,” on page 
601 , a member function void print(int pos, int k). This function overloads 
printO and is meant to print the k characters of the string starting at position 

pos. 
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Overload the operator * in class stri ng. Its member declaration will be 

string string::operator*(int n); 

The expression s * k will be a string that is k copies of the string s. Check that thi 
does not overrun storage. 

Write a cl ass person that would contain basic information such as name, birth 
date, and address. Derive class student from person. 


Write a cl ass tri angl e that inherits from shape. It needs to have its own area() 
member function. 


15 The function reverse () can be written generically as follows: 

// generic reversal 

template cclass T> 

void reverseCT v[], int n) 

{ 

stack<T> stk(n); 
for (int i =0; i < n; ++i) 
stk.push(v[i]); 
for (i =0; i < n; ++i) 
v[i] = stk.popQ; 


Try this on your system, using it to reverse an array of characters and to reverse an 
array of char*. 


16 (S. damage.) The next three programs behave differently: 
// Function declarations at file scope 


int f(int); 
double f(double) 

double add_f() 

{ 


// overloads f(int) 


} 


return (f(l) + f(1.0)); // f(int) + f(double) 
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Now we place one function declaration internally. 

// Function declaration at local scope 
i nt f(int) ; 
double add_f() 

{ double f(double); // hides f(int) ... 

return (fCl) + f(1.0)); // f(double) + f(double) 

} 

Now we place the other function declaration internally. 

double f(double); 
double add_f() 

f 

i nt f (i nt); . 

return (f(l) + f(1.0)); // What is called here. 

} 

t 

Write some test programs that clearly show the different behaviors. 




















loving from C to Java 


Tills chapter gives an overview of the Java programming language, It also provides an 
introduction to Java’s use as an object-oriented programming language. It is organized 
in the manner of the C++ chapter with a series of programs presented, and the ele¬ 
ments of each program explained. The programs increase in complexity, and the exam¬ 
ples in the later sections illustrate some of the concepts of object-oriented 
programming. It can be read independently of the C++ chapter. 

The examples in this chapter give simple, immediate, hands-on experience with key 
features of the Java language. The chapter introduces the reader to Java I/O, classes, 
inheritance, graphics, threads, and exceptions. Mastery of the individual topics requires 
a thorough reading of a companion book such as Arnold and Gosling, The Java Pro¬ 
gramming Language (Reading, MA: Addison-Wesley, 1996), 

Object-oriented programming is implemented by the class construct. The class 
construct in Java is an extension of struct in C. The later examples in this chapter 
illustrate how Java implements OOP (object-oriented programming) concepts, such as 
data hiding, ADTs, inheritance, and type hierarchies. Java has been designed to be used 
on the World Wide Web. It has special libraries designed for graphics and communica¬ 
tion across the Net. It is designed to run in a machine- and system-independent man¬ 
ner. This means the Java program will execute with the same results on a PC running 
Windows 95 or a workstation running SUN Solaris. It does this by defining its semantics 
completely in terms of a virtual machine. The job for a system that wants to run Java is 
to port the virtual machine. This is a trade-off between portability and efficiency. There 
inevitably is additional overhead in a machine running a simulator of a different archi¬ 
tecture. Some of this inefficiency can be overcome by the use of just-in-time compilers 
or the use of native code written in C that is used where efficiency is crucial. On many 
platforms, it is also possible to employ a direct-to-native code compiler for maximum 
run-time efficiency. 
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14.1 Output 

Programs must communicate to be useful. Our first example is a program that prints 
on the screen the phrase “Java is an improved C.” The complete program is 1 

In file Improved.java 

// A first Java program illustrating output. 

// Title: Improved 

// Author: Richmond Q. Programmer 

class Improved { 

public static void main (String[] args) 

{ 

System.out.printlnC'Java is an improved C."); 

} 

} 

The program prints on the screen 
Java is an improved C. 

This program is compiled using the command javac improved.java-, resulting in a 
code file named Improved.dass. This can be run using the command java Improved. 


[3? 
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Dissection of the improved Program 

■ //A first Java program illustrating output. 

The double slash // is a new comment symbol. The comment runs to the end of the 
line. The old C bracketing comment symbols /* */ are still available for multiline 

comments. Java also provides /** */ bracketing comment symbols for a doc comment. 
A program javadoc uses doc comments and generates an HTML file. 



■ class Improved { 

java programs are classes. A class has syntactic form that is derived from the C 
struct, which is not in Java. In Java, class identifier names, such as Improved, are by 
convention capitalized. Data and code are placed within classes. 

■ public static void main (String[] args ) 

When a class is executed as a program, it starts by calling the member function mai n(). 
In this case, mai n() is a member of Improved. In java, command line arguments are 
passed in an array of Stri ng’s. In C, we need an argc variable to tell the program the 
number of command line arguments. In Java, this array length is found by using 
args.length. 


■ System.out.println("Java is an improved C."); 


This statement prints to the screen. The System. out object uses the member function 
pri ntl n() to print. The function prints the string and adds a new line, which moves 
the screen cursor to the next line. Unlike pri ntf () in C, pri ntl n() does not use for¬ 
mat controls. 



In Java, all functions are contained in classes. In this case, the function main() is a 
member of cl ass Improved. A member function is called a method. 


14.2 Variables and Types 

We will write a program to convert to kilometers the distance in miles from the Earth to 
the moon. In miles this distance is, on average, 238,857 miles. This number is an inte¬ 
ger. To convert miles to kilometers, we multiply by the conversion factor 1.609, a real 
numb cr 

Our conversion program will use variables capable of storing integer values and real 
values. The variables in the following program will be declared in main(). Java cannot 
have variables declared as extern (in other words, as global or file scope variables). 

The primitive types in a Java program can be boolean, char, byte, short, i nt, long, 
float, and double. These types are always identically defined regardless of the 
machine or system they run on. For example, the i nt type is always a signed 32-bit inte- 
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ger, unlike in C, where this can vary from system to system. The boolean type is not an 
arithmetic type and cannot be used in mixed arithmetical expressions. The char type 
use 16-bit Unicode values. The byte, short, i nt , and 1 ong are all signed integer types, 
whose length in bits is 8, 16, 32, and 64 respectively. Unlike in C, unsigned types are 
not provided. The floating types comply with IEEE754 standards and are f 1 oat, a 32-bit 
size, and doubl e, a 64-bit size. The non-primitive types are class types and array types, 
and variables of these types take references as their value. 

In file Moon.java 

// The distance to the moon converted to kilometers. 

// Title: moon 

public class Moon { 

public static void main(String[] s) { 
int moon = 238857; 
int moon„kilo; 

System.out.println("Earth to moon = " + moon + " mi."); 
moon_kilo = (int)(moon * 1.609); 

System, out. println("Kilometers = " + moonjdlo + ,, km. n ); 

} 

The output of the program is 

Earth to moon = 238857 mi. 

Kilometers = 384320 km. 



Dissection of the Moon Program 


n int moon = 238857; 

Variables of type int are signed 32-bit integers. They can be initialized as in C. 

■ System.out.println("Earth to moon = " + moon + " mi."); 

The pri ntl n() method can discriminate among a variety of simple values without 
needing additional formatting information. Here, the value of moon will be printed as an 
integer. The symbol + represents string concatenation. Using “plus” pri ntl n() can 
print a list of arguments. What is happening is that each argument is converted from its 
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specific type to an output string that is concatenated together and printed along with a 
newline. 


u moon_kilo = (int)(moon * 1.609); 


The mixed expression moon * 1.609 is a doubl e. It must be explicitly converted to 
i nt. 



Note that narrowing conversions that are implicit in C are not done in Java. 


14.3 Classes and Abstract Data Types 

What is novel about Java is its aggregate type cl ass. A cl ass is an extension of the 
idea of struct in traditional C. A cl ass provides the means for implementing a user- 
defined data type and associated functions. Therefore, a cl ass can be used to imple¬ 
ment an ADT. Let us write a cl ass called Person that will be used to store information 
about people. 

In file Person.java 

// An elementary implementation of type Person. 

class Person { 

private String name; 
private int age; 

private char gender; //male == *M* , female == ? F ! 

public void assignName(String nm) { name = nm; } 
public void assignAge(int a) { age = a; } 
public void assignGender(char b) { gender = b; } 
public String toString( 

return (name + " age is " + age + 

" sex is " + gender ); 
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Two important additions to the structure concept of C are found in this example-; 
first, it has members called class methods that are functions, such as Assi gnAge(), and 
second, it has both public and private members. The keyword publ i c indicates the vis¬ 
ibility of the members that follow it. Without this keyword, the members are private to 
the class. Private members are available for use only by other member functions of the 
class. Public members are available anywhere the class is available. Privacy allow r s part 
of the implementation of a class type to be “hidden.” Tins restriction prevents unantici¬ 
pated modifications to the data structure. Restricted access, or data hiding, is a feature 
of object-oriented programming. 

The declaration of methods inside a class allows the ADT to have actions or behav¬ 
iors that can act on its private representation. For example, the member function 
toStringQ has access to private members and gives Person a string representation 
used in output. This method is common to many class types. 

We can now use this data type Person as if it were a basic type of the language. Other 
code that uses this type is a client. The client can use only the public members to act on 
variables of type Person. 

//PersonTest.java uses Person. 

public class PersonTest { 

public static void main (String!] args ) 

{ 

System.out.println("Person test:"); 

Person pi = new PersonO; //create a Person object 
pi.assignAge(20); 
pl.assignName("Alan Turing"); 
pl.assignCender(false); 

System.out. println(pl.toStringO) ; 


The output of this example program is 
Person test: 

Alan Turing age is 20 sex is M 

Notice the use of new PersonO to create an actual instance of Person. The new opera¬ 
tor goes off to the heap, much as mal 1 oc() does in C and obtains memory for creating 
an actual instance of object Person. The value of pi is a reference to this object. In 
effect, this is the address of the object. 
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14.4 Overloading 

The term overloading refers to the practice of giving several meanings to a method. The 
meaning selected depends on the types of the arguments passed to the method called 
the method’s signature. Let us overload the function assi gnGenderQ in the previous 
example. This will be a second definition of the assignGenderO method. 

class Person { 

public void assignGender(char b) { gender = b; } 
public void assignGender(String b) 

{ gender = ((b == "M")? 'M': ’F'); } 

} 

This version of assignGenderO takes a single argument of type String. It will con¬ 
vert and store this properly as a gender character value. Now a user can use either a 
char or a Stri ng value in assigning gender. 


14.5 Construction and Destruction of Class Types 

A constructor is a function whose job is to initialize an object of its class. Constructors 
are invoked after the instance variables of a newly created class object have been 
assigned default initial values and any explicit initializers are called. Constructors are 
frequently overloaded. 

A constructor is a member function whose name is the same as the class name. The 
constructor is not a method and does not have a return type. Let us change our Person 
example to have constructors to initialize the name instance variable. 

//constructor to be placed in Person 

public PersonO {name = "Unknown";} 
public Person(String nm) { name =nm;} 
public Person(String nm, int a, char b) 

{ name =nm; age =a; gender = b;} 
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These would be invoked when new gets used to associate a created instance with the 
appropriate type reference variable. For example, 








pi = new Person(); 

pi = new Person("Laura Pohl"); 

pi = new Person("Laura Pohl" 9, ’F') 




-M 


//creates "unknown 0 M 
//creates Laura Pohl 0 M 
//creates Laura Pohl 9 F1BP 








The overloaded constructor is selected by the set of arguments that matches the con/tl 
structors parameter list. 

. m mmi 

Destruction is done automatically by the system using automatic garbage collection, M 
When the object can no longer be referenced, for example, when the existing referenc|81i 
is given a new object, the now inaccessible object is called garbage. Periodically, the sys- sflj 
tern sweeps through memory and retrieves these “dead” objects. The programmer needJJ 
not be concerned with such apparent memory leaks. 




- 

In' 






14.6 Object-oriented Programming and Inheritance 


* - 


IBS 


In Java, inheritance is the mechanism of extending a new' class from an existing one 
called the superclass. The extended class adds to or alters the inherited superclass 
methods. This is used to share interface and to create a hierarchy of related types. 

Consider designing a data base for a college. The registrar must track different types 
of students. The superclass w ; e start with will be Personl. This class will be identical to 
Person, except that the private instance variables will be changed to have access pro¬ 
tected. This access allows their use in the subclass, but otherwise acts like private. 

Here is an example of deriving a class: 


*1 















MM™ 

. .'.*.»• 

,S 4 
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// Note Personl is Person with private instance variables 
// made protected 

class Student extends Personl { 
private String college; 

private byte year; //l=fr, 2= so, 3 = jr, 4=sr 

private double gpa; //0.0 to 4.0 

public void assignCollege(String nm) { college = nm; } 

public void assignYearfbyte a) { year = a; } 

public void assignGpafdouble g) { gpa = g; } 

public String toStringQ % , 

{ return (super.toStringQ + " College is " + college); } 

public StudentC) 

{super.assignName("Unknown"); college = "Unknown ;} 
public Student(String nm) 

{ super(nm); college = "Unknown"; } 
public Student(String nm, int a, char b) 

{ name =nm; age =a; gender = b; } 

>; 

In this example, Student is the subclass, and Personl is the superclass. Notice the use 
of the keyword super. It provides a means of accessing the instance variables or meth¬ 
ods found in the superclass. 

The inheritance structure provides a design for the overall system. The superclass 
Personl leads to a design where the subclass Student is derived from it. Other sub¬ 
classes such as GradStudent or Employee could be added to this inheritance hierarchy. 


14.7 Polymorphism and Overriding Methods 

In Java, polymorphism comes from both method overloading and method overriding. 
Overloading has already been discussed. Overriding occurs when a method is redefined 
in the subclass. The toStri ng() method is in Personl and is redefined in Student 

extended from Personl. 
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//Overriding the printName() method 
class Personl { 

protected String name; 
protected int age; 

protected char gender; //male 

public toStringQ { 

return(name + " age is " + age + 

" sex is " + (gender == 'F' 

} 


female == 


i r i 


"M") ); 


class Student extends Personl { 
private String college; 
private byte year; 
private double gpa; 
public toStringQ 

{ return(super.toString() + 


//0.0 to 4,0 

" College is " + college); } 


■ . K# 


'.'s'IS 


// ‘-A 

■. * \ 




- 

The overridden method toStringO has the same name and signature in both the 
superclass Personl and the subclass Student. Which one gets selected depends at 
runtime on what is being referenced. For example, in the code i ll s 


//StudentTest.java use Personl 

public class StudentTest { 

public static void main (String[] args ) 

{ 

Personl ql; 

ql = new StudentO; 

ql,assignName("Charles Babbage"); 

System.out.println(ql.toString()); 

ql = new PersonlQ; 

ql.assignName("Charles Babbage"); 

System.out,println(ql.toString()); 


/v\ w- - • -.w.d 

• c, .. 

... 

A-': ii 






wi,/.-.; 


... ... . 


- I 


The variable ql can refer to either Personl object or the subtype Student object. At 
runtime the correct toStri ng() will be selected. The assignName() method is known 
at compile time since it is the superclass Personl method. 
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14.8 Applets 

lava is known for providing applets on Web pages. A browser is used to display and 
execute the applet. Typically, the applet provides a graphical user interface to the code. 
The next piece of code will be an applet for computing the greatest common divisor for 

two numbers. 

//GCD applet implementations 

import java.applet.*; //gets the applet superclass 
import java.awt.*; //abstract windowing toolkit 

import java.io.*; 

//derived from the class Applet 

public class wgcd extends Applet { 
int x, y, z, r; 

TextField a = new TextField(10); //input box 

TextField b = new TextField(10); //input box 

TextField c = new TextField(10); //output box 

Label 11 = new Label("Valuel: "); 

Label 12 = new Label ("Value2: "); 

Button gcd = new ButtonC GCD: "); 

//draws the screen layout such as the TextFields 

public void init() { 

setLayout(new FlowLayoutQ); 
c.setEditable(false); 
add(ll); add(a); 
add(12); add(b); 
add(gcd); add(c); 

} 

//computes the greatest common divisor 

public int gcd(int m, int n) { 
while (n !=0) { 
r = m % n; 
m = n; 
n = r; 

} 

return m; 

> 
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//looks for screen events to interact with 

public boolean action(Event e, Object o) { 
if (" GCD: ".equals(o)) { //press button 
x = Integer.parselnt(a.getText()); 
y = Integer. parselnt(b. getTextO) ; 
z = gcd(x,y); 

//place answer in output TextField 
c.setText(Integer.toString(z)); 


return true; 


The code uses the graphics library awl: and the applet class to draw an interactive inter¬ 
face that can be executed either by a special program called the appletvlewer or by a 
Java-aware browser such as Microsoft Explorer or Netscape Navigator. Unlike ordinary 
Java programs, this program does not use a mai n() method to initiate the computation. 
Instead, the initQ method draws the screen. Further computation is event-driven and 
processed by the actionQ method. The user terminates the applet by clicking on the 
Quit command in the applet pull-down menu. 


14.9 Java Exceptions 


Java has an exception-handling mechanism that is integral to the language and is 
heavily used for error detection at runtime. It is similar to the one found in C++. An 
exception is thrown by a method when it detects an error condition. The exception will 
be handled by invoicing an appropriate handler selected from a list of handlers called 
catches. These explicit catches occur at the end of an enclosing try block. An uncaught 
exception is handled by a default Java handler that issues a message and terminates the 
program. An exception is itself an object, which must be derived from the superclass 
Throwable. As a simple example of all this, we will add an exception NoSuchNameEx- 
ception to our Person example class. 

class NoSuchNanieException extends Exception { 
public String str() { return name; } 
public String name; 

NoSuchNameException(String p) { name = p; } 

}; 
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The purpose of this exception is to report an incorrect or improperly formed name. 
In many cases, exceptions act as assertions would in the C language. I hey determine 
whether an illegal action has occurred and report it. We now modify the Person code to 
take advantage of the exception. 

//Person2.class: Person with exceptions added 

class Person2 { 

private String name; 

public Person2(String pjthrows NoSuchNameException { 
if ( p == •'") 

throw new NoSuchNameException(p); 
name = p; } 

public String toString(){ return name;} 
public static void main(String[] args) 
throws NoSuchNameException 

{ 

try{ 

Person2 p = new Person2("ira pohl"); 

System.out.pri ntl nC'PERSONS"); 

System.out.println(p.toString()); 
p = new Person2(""); 


+ t.strO); } 


catch(NoSuchNameException t) 

{ System.out.println(" exception with name 
fi nally 

{ System.out.println("finally clause"); } 


The throwO has a NoSuchNameException argument and matches the catch() sig¬ 
nature. This handler is expected to perform an appropriate action where an incorrect 
name has been passed as an argument to the Person2 constructor. As in this example, 
an error message and abort are normal. The f i nal 1 y clause is shown here. It is code 
that is done regardless of how the try block terminates. 
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14.10 Benefits of Java and OOP 

Java shares with C++ the use of classes and inheritance to build software in an object- 

oriented manner. Also, both use data hiding and have methods that are bundled within 
the class. 

Unlike C++, Java does not allow for conventional programming. Everything is encap¬ 
sulated in some class. This forces the programmer to think and design everything as an 
object. The downside is that conventional C code is not as readily adapted to Java as it 
is to C++. Java avoids most of the memory pointer errors that are common to C and 
C++. Address arithmetic and manipulation are done by the compiler and system— not 
the programmer. Therefore, the Java programmer writes safer code. Also, memory rec¬ 
lamation is automatically done by the Java garbage collector. 

Another important concept in OOF is the promotion of code reuse through the inher¬ 
itance^ mechanism. In java, this is the mechanism of extending a new class called a sub¬ 
class lrom an existing one called the superclass. Methods in the extended class override 

the superclass methods. The method selection occurs at runtime and is a highly flexible 
polymorphic style of coding. 

java, in a strict sense, is completely portable across all platforms that support it. Java 
is compiled to byte code that is run on the Java virtual machine. This is typically an 
interpreter—code that understands the Java byte code instructions. Such code is much 
slower than native code on most systems. The trade-off here is between universally 
consistent behavior versus loss of efficiency. 

Java has extensively developed libraries for performing Web-based programming. It 
has the ability to write graphical user interfaces that are used interactively. It also has a 

thread package and secure web communication features that let the coder write distrib¬ 
uted applications. 

java is far simpler than C++ in the core language and its features. In some ways, this 
is deceptive in that much ol the complexity is in its libraries. Java is far safer because of 
very strict typing, avoidance of pointer arithmetic, and well-integrated exception han¬ 
dling. It is system-independent in its behavior, so one size fits all. This combination of 

OOP, simplicity, universality, and Web-sensitive libraries makes it the language of the 
moment. 
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Summary 

1 The double slash // is a new comment symbol. The comment runs to the end of 

the line. The old C bracketing comment symbols /* */ are still available for 

multiline comments. Java also provides /** V bracketing comment symbols for 
a doc comment. A program javadoc uses doc comments and generates an HTML 
file. 

2 Java programs are classes. A class has syntactic form that is derived from the C 
struct, which is not in Java. Data and code are placed within classes. When a 
class is executed as a program, it starts by calling the member function mai n(). 

3 The term overloading refers to the practice of giving several meanings to a 
method. The meaning selected depends on the types of the arguments passed to 
the method called the method’s signature. 

4 A constructor is a function whose job is to initialize an object of its class. Con¬ 
structors are invoked after the instance variables of a newly created class object 
have been assigned default initial values and any explicit initializers are called. 
Constructors are frequently overloaded. Destruction is done automatically by the 
system using automatic garbage collection, 

5 Inheritance is the mechanism of extending a new class from an existing one 
called the superclass. The extended class adds to or alters the inherited super¬ 
class methods. This is used to share interface and to create a hierarchy of related 
types. 

6 In Java, polymorphism comes from method overloading and method overriding. 
Overriding occurs when a method is redefined in the subclass. The selection of 
the appropriate overridden method definition is decided at runtime depending 
on the object’s type. 

7 Java is known for providing applets on Web pages. A browser is used to display 
and execute the applet. Typically, the applet provides a graphical user interface 
to the code. In applets, an acti on() method picks up an event, such as a mouse 
click, and decides on a next action. 

8 An exception is thrown by a method when it detects an error condition. The 
exception will be handled by invoicing an appropriate handler selected from a list 
of handlers called catches. These explicit catches occur at the end of an enclos¬ 
ing try block. An uncaught exception is handled by a default Java handler that 
issues a message and terminates the program. 
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9 Unlike C++, Java does not allow for conventional programming. Everything is 
encapsulated in some class. This forces the programmer to think and design 
everything as an object. The downside is that C code is not as readily adapted to 
Java as it is to C++. The upside is that the Java programmer writes safer code. 


Exercises 


1 Using Java I/O, output (a) all on one line, (b) on three lines, (c) inside a box: 

she sells sea shells by the seashore 

2 Write a program that will convert distances measured in yards to distances mea¬ 
sured in meters. The relationship is 1 meter equals 1.0936 yards. If you can, write 
the program to read in distances; otherwise, do it by simple assignment to an 
instance variable inside the method mai n (). 

3 Write an applet that asks interactively for your name and age and responds with 

Hello name, next year you will be nexLage. 
where next_age is age + 1. 

4 Write a program that prints out a table of squares, square roots, and cubes. Use 
either tabbing or strings of blanks to get a neatly aligned table. 


1 * 


square root 
1.00000 


i * i * *j 


I «r * « c 


5 Write a class that can perform complex arithmetic. Unlike C++, in Java you cannot 
overload operators. Write method Comp! ex. pi us(Compl ex) and method Com¬ 
plex, minus (Comp! ex), such that they return an appropriate Complex result. 

6 Write a class GradStudentO that extends Student. Add to Student additional 
information that includes the graduate student’s thesis topic and routines for 
assigning to this part of the class and printing out this information. 

7 Add exceptions to the class GradStudent so that improperly initializing Grad- 
Student objects wall result in a run-time exception. 


Appendix A 

The Standard Library 


The standard library provides functions that are available for use by the programmer. 
Associated with the library are standard header files provided by the system. These 
header files contain prototypes of functions in the standard library, macro definitions, 
and other programming elements. If a programmer wants to use a particular function 
from the library, the corresponding header file should be included. Here is a complete 
list of the header files: 


C header files 


<assert,h> 
cctype.h> 
cerrno.h> 
<float.h> 


<1 i mi ts. h> 
<1ocale«h> 
<math.h> 
<setjmp,h> 


<signal.h> 
<stdarg.h> 
<stddef.h> 
<stdio.h> 


<stdlib.h> 

<string.h> 

<time.h> 


These files may be included in any order. Also, they may be included more than once, 
and the effect will he the same as if they were included only once. In this appendix, we 
organize our discussion by header file. 


A.1 Diagnostics: <assert.h> 


This header file defines the assert () macro. If the macro NDEBUG is defined at the 
point where <assert. h> is included, then all assertions are effectively discarded. 
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■ void assert(int expr); 

If expr is zero (false), then diagnostics arc printed and the program is aborted. The 
diagnostics include the expression, the file name, and the line number in the file. 


A.2 Character Handling: <ctype.h> 

This header defines several macros that are used to test a character argument. In addi¬ 
tion, there are function prototypes for two functions used to map a character argu¬ 
ment. 


Testing a Character 


int isalnuni(i 
int isalpha(i 
int iscntr1(i 
int isdigit(i 
int isgraph(i 
int islowerfi 
int isprint(i 
int ispunctCi 
int isspaceCi 
int isupper(i 
int isxdigitf 


nt c); 
nt c); 
nt c); 
nt c); 
nt c); 
nt c); 
nt c); 
nt c) ; 
nt c) ; 
nt c) ; 
int c); 


/* is 
/* is 


alphanumeric */ 
alphabetic */ 
control */ 
digit: 0-9 */ 
graphic */ 
lowercase */ 
printable */ 
punctuation */ 
white space */ 
uppercase */ 

hex digit 0-9, a-f, A-F */ 


These character tests are typically implemented as macros; see ctype.h in your installa¬ 
tion for details. If the argument c satisfies the test, then a nonzero value (true) is 
returned; otherwise, zero (false) is returned. These macros should also be available as 
functions. 

The printing characters are implementation-defined, but each occupies one printing 
position on the screen. A graphic character is any printing character, except for a space. 
Thus, a graphic character puts a visible mark on a single printing position on the 
screen. A punctuation character is any printing character other than a space or a char¬ 
acter c for which isalnum(c) is true. The standard white space characters are space, 
form feed (' \f'), newline (' \n'), carriage return (' \r'), horizontal tab (' \t'), and ver¬ 
tical tab C\v'). The control characters are the audible bell (' \a'), backspace (’ \b'), 
any character c for which isspace(c) is true other than space, and control-c, control- 
h, and so on. 
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Mapping a Character 

The two functions tolowerQ and toupperQ are used to map a character argument. 
Caution: Early versions of many ANSI C compilers did not implement these functions 
correctly. 

m int tolower(int c); 

If c is an uppercase letter, the corresponding lowercase letter is returned; otherwise, c 
is returned. 

n int toupperfint c); 

If c is a lowercase letter, the corresponding uppercase letter is returned; otherwise, c is 
returned. 

The next three macros often occur on ASCII machines. The first two are related to, 
but not the same as, tolowerO and toupperO. 

#define _tolower(c) ((c) + ’a' - 'A') 

#define _toupper(c) ((c) + 'A' - 'a') 

#define toascii(c) ((c) & 0x7f) 

The hexadecimal constant 0x7f is a mask for the low-order 7 bits. 


A.3 Errors: <errno.h> 

The identifier errno is defined here, along with several macros that are used to report 
error conditions. 

extern int errno; 

Typically, there are lots of macros in errno.h. Which macros occur is system-depen¬ 
dent, but all names must begin with E. Various library functions use these macros for 
error reporting. 

Two macros are common to all systems. These are used by the mathematical func¬ 
tions in the library: 


#define EDOM 33 

#define ERANGE 34 


/* domain error */ 
/* range error */ 
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Values other than 33 and 34 could be used here, but these values are typical. 

The domain of a mathematical function is the set of argument values for which it is 
defined. For example, the domain of the square root function is the set of all nonnega¬ 
tive numbers. A domain error occurs when a mathematical function is called with an 
argument not in its domain. When this happens, the system assigns the value EDOM to 
errno. The programmer can use perrorQ and strerrorQ to print a message associ¬ 
ated with the value stored in errno. > 

A range error occurs when the value to be returned by the function is defined mathe¬ 
matically but cannot be represented in a double. When this happens, the system 
assigns the value ERANGE to errno. 


A.4 Floating Limits: <float .h> 

Macros that define various floating characteristics and limits are defined here. There 
are many of them. Some examples are 


#define 
#define 
#define 


DBI_MAX 

FLT_MAX 
LDBI_MAX 


1.7976931348623157e+308 

3.40282347e+38F 

1.7976931348623157e+308 


#define 
#define 
#define 


DBI_MIN 

FLT_MIN 

LDBL_MIN 


2.2250738585072014e-308 
1.17549435e-38F 
2.2250738585072014e-308 


#define 
#define 
#define 


DBI_EPSILON 

FLT_EPSILON 
LDBI_EPSILON 


2.2204460492503131e-16 
1.19209290e-07F 
2.2204460492503131e-16 


The constants are system-dependent. We are assuming that a long double is imple¬ 
mented as a double. Not all systems do this. Some provide more precision and range; 
see float.h on your system. 
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A.5 Integral Limits: <1imits.h> 

Macros that define various integral characteristics and limits are defined here. There 
are many of them. Some examples are 

#define CHAR_BIT 8 /* number of bits in a byte */ 

#define CHAR_MAX 127 

#define CHAR_MIN (-128) 

#define SHRT_MAX 32767 

#define SHRT_MIN (-32768) 

#define INT_MAX 2147483647 

#define INT_MIN (-2147483648) 

The constants are system-dependent. 


A.6 Localization: <locale.h> 

This header contains programming constructs that can be used to set or access proper¬ 
ties suitable for the current locale. The following structure type is defined: 

struct Iconv { 

char *decimal_point; 
char *thousands_sep; 
char *currency_symbol; 

}; 

The members allow for local variations, such as using a comma instead of a period for a 
decimal point. At least six symbolic constants are defined. 

#define LC_ALL 1 /* all categories */ 

fdefine LC_COLLATE 2 /*strcoll() and strxfrm */ 

#define LC_CTYPE 3 /^character handling functions */ 

#define LC_MONETARY 4 /* monetary info in localeconv() */ 

#define LC_NUMERIC 5 /^decimal point in lib fcts */ 

#define LC_TIME 6 /*strftime()*/ 
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7^ ues °1the symbolic constants are system-dependent. Other macros besinnina 

ThGSe m3Cr0S Can bCUSCd 3S the -gumentL S 


char '''setlocale(int category, const char *locale) 


“'N S ULL P !! ed ^ the SySMm ' that d “ CTlbeS “* -w localeifttS vS "'X- 
W1Se ' the NULL pomter 18 returned. At program startup, the system behaves as if 


setlocale(LC_ALL, "C"); 


tabeeu executed. This specifies a minimal environment for C translation. The state- 


set!ocale(LC_ALL, 


whichis system-dependent. Using a macro other than 


LC ALL affprK nnh; « a n- 1 r Z 7 u ^ UiUUUt * macro omer 

Of the locale JeanVwith locale - For example, LC_MONETARY affects only that 
or me locate dealmg with monetary information. 


part 


struct 1conv 'localeconv(void); 


i!?!?.!!L t 0 „ a !! rUCt o re provided b >' the s T stem is returned. It is of static duration and 


contains nnmprir i i -- ui otaiu. uuiduan ana 

setl ocal e() f ° Ut ™ rrent Iocale ‘ Further calIs t0 the function 

LJ ati ciiange the values stored in the structure. 


A.7 Mathematics: <math.h> 


So ^ml r s“ne C STdSS eS ** ^ mathematical functions in the library. It 


#define HUCE_VAL 1.7976931348623157e+308 
The value of the macro is system-dependent 


deMed°Adnma- a mathematlcal functi °n is the set of argument values for which it is 
d. A domain error occurs when a mathematical function is called with an argu- 
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ment not in its domain. When this happens, the function returns a system-dependent 
value, and the system assigns the value EDOM to errno. 

A range error occurs when the value to be returned by the function is defined mathe¬ 
matically but cannot he represented in a doubl e. If the value is too large in magnitude 
(overflow), then either HUGE_VAL or -HUGE_VAL is returned. If the value is too small in 
magnitude (underflow), zero is returned. On overflow, the value of the macro ERANGE is 
stored in errno. What happens on underflow is system-dependent. Some systems store 
ERANGE in errno; others do not. 

■ double cos(double x); 
double sin(double x); 
double tan(double x); 

These are the cosine, sine, and tangent functions, respectively. 

■ double acos(double x); /* arccosine of x */ 

double asin(double x); /* arcsine of x */ 

double atan(double x); /* arctangent of x */ 

double atan2(double y, double x); /* arctangent of y/x */ 

These are inverse trigonometric functions. The angle theta returned by each of them is 
in radians. The range of the acos() function is [0, it]. The range of the asinQ and 
atanQ functions is rt/2]. The range of the atan2() function is [-it, tiJ. Its princi¬ 
pal use is to assist in changing rectangular coordinates into polar coordinates. For the 
functions acosQ and asi n(), a domain error occurs if the argument is not in the 
range [-1, 1]. For the function atan2(), a domain error occurs if both arguments are 
zero and y/x cannot be represented. 

■ double cosh(double x); 
double sinh(double x); 
double tanh(double x); 

These are the hyperbolic cosine, hyperbolic sine, and hyperbolic tangent functions, 
respectively. 

■ double exp(double x); 
double log(double x); 
double logl0(double x); 

The exp() function returns e x . The log() function returns the natural logarithm 
(base e) of x. The logl0() function returns the base 10 logarithm of x. For both log 
functions, a domain error occurs if x is negative. A range error occurs if x is zero and 
the logarithm of zero cannot be represented. (Some systems can represent infinity.) 
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■ double ceil(double x); 
double floor(double x); 

The ceiling function returns the smallest integer not less than x. The floor function 
returns the largest integer not greater than x. 

H double Tabs(double x); /* floating absolute value */ 

Returns the absolute value of x. Caution: The related function absQ is designed for 
integer values, not floating values. Do not confuse abs() with fabs(). 

■ double fmod(double x, double y); /* floating modulus */ 

Returns the value x (mod y). More explicitly, if y is nonzero, the value x - i * y is 
leturned, where i is an integer such that the result is zero, or has the same sign as x 
anc magnitude less than the magnitude of y. If y is zero, what is returned is system- 
c ependent, but zero is typical. In this case a domain error occurs on some systems. 

■ double pow(double x, double y); /* power function */ 

Returns x raised to the y power. A domain error occurs if x is negative and y is not an 
integer. 

■ double sqrt(double x); /* square root */ 

Returns the square root of x, provided x is nonnegative. A domain error occurs if x is 
negative. 

■ double frexp(double value, int v exp_ptr); /* free exponent */ 

This is a primitive used by other functions in the library. It splits val ue into mantissa 
and exponent. The statement 

x = frexp(value, &exp); 
causes the relationship 

val tie = x * 2 exp 

to hold, where the magnitude of x is in the interval [1/2, 1] or x is zero. 

■ double ldexp(double x, int exp); /* load exponent */ 

The value x * 2 ex P is returned. 
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■ double modf(double value, double *i_ptr); 

Breaks val ue into integer and fractional parts. The function call modf (val ue, &i) 
returns the value f, and indirectly the value i, so that 

value = i + f 


A.8 Nonlocal Jumps: <setjmp.h> 

This header provides one type definition and two prototypes. These declarations allow 
the programmer to make nonlocal jumps. A nonlocal jump is like a goto, but with the 
flow of control leaving the function in which it occurs. The type definition is system- 
dependent. The following is an example: 

typedef long jmp_buf[16]; 

An array of type jmp_buf is used to hold system information that will be used to 
restore the calling environment. 

■ int setjnip(jmp_buf env) ; 

Saves the current calling environment in the array env for later use by longjmpQ and 
returns zero. Although on many systems this is implemented as a function, in ANSI C it 
is supposed to be implemented as a macro. 

■ void longjmp(jmp_buf env, int value); 

The function call 1 ongimp(env, val ue) restores the environment saved by the most 
recent invocation of setjmp(env). If setjmp(env) was not invoked, or if the function 
in which it was invoked is no longer active, the behavior is undefined. A successful call 
causes program control to jump to the place foliowing the previous call to set- 
jmp(env). If value is nonzero, the effect is as if setjmp(env) were called again with 
val ue being returned. If val ue is zero, the effect is as if set jmp(env) were called 
again with one being returned. 
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A.9 Signal Handling: <signal.h> 


This header contains constructs used by the programmer to handle exceptional condi¬ 
tions, or signals. The following macros are defined in this header: 


#define 

SIGINT 

2 

/* 

#define 



/* 

#define 

SIGFPE 

8 

/* 

fdefine 

SIGSEGV 

11 

/it 

#define 

SIGTERM 

15 

/* 

#define 

SIGABRT 

22 

/it 


interrupt */ 
illegal instruction */ 
floating-point exception */ 
segment violation */ 
asynchronous termination */ 
abort */ 


The constants are system-dependent, but these are commonly used. Other signals are 
usually supported; see the file signal.h on your system. 

The macros in the following set may be used as the second argument of the function 
signal (): 

#define SIG_DFL ((void (*)(int)) 0) /* default */ 

#define SIG_ERR ((void (*)(int)) -1) /* error */ 

#define SIG_IGN ((void (*)(int)) 1) /* ignore */ 

A system may supply other such macros. The names must begin with SIG_ followed by 
a capital letter. 

■ void (*signal(int sig, void (*func)(int)))(int); 

The function call si gnal (si g, func) associates the signal si g with the signal handler 
func(). If the call is successful, the pointer value func of the previous call with first 
argument si g is returned, or NULL is returned if there was no previous call. If the call is 
unsuccessful, the pointer value SIG_ERR is returned. 

The function call signal (sig, func) instructs the system to invoke func(sig) 
when the signal sig is raised. If the second argument to signal () is SIG_DFL, default 
action occurs; if it is SIG_IGN, the signal is ignored. When program control returns 
from func(), it returns to the place where si g was raised. 

■ int raise(int sig); 

Causes the signal sig to be raised. If the call is successful, zero is returned; otherwise, 
a nonzero value is returned. This function can be for testing purposes. 
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A.10 Variable Arguments: <stdarg.h> 

This header file provides the programmer with a portable means of wr ^g functions 
such as printf () that have a variable number of arguments. The header file contain 
one typedef and three macros. How these are implemented is system-depen en , u 


typedef char * vajlist; 


#define 

#define 
#define 


va_start(ap, v) \ . 

((void) (ap = (va_list) &v + sizeof(v))) 

va_arg(ap, type) (*((type ‘")(ap))+ + ) 

va_end(ap) ((void) (ap - 0)) 


In the macro va_start (), the variable v is the last argument that is declared in the 
header to your variable argument function definition. This variable cannot be of storage 
class reqi ster, and it cannot be an array type or a type such as char that is widened 
by automatic conversions. The macro va_start() initializes the argument Pomter ap. 
The macro va_arg() accesses the next argument in the list. The macro va - en ^ () P® 
forms any cleanup that may be required before function exit. The following program 




#include <stdio.h> 
#include <stdarg.h> 

int va_sum(int cnt, ...); 


int main(void) 

{ 

int a = 1, b = 2, c — 3; 

orintf("First call: sum =%d\n", va_sum(2, a, b)); 
printf ("Second call: sum = %d\n", va_sum(3, a, b, O); 

return 0; 

> 
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int va_sum(int cnt, ...) 

{ 

int i, sum = 0; 

va_list ap; 

va_start(ap, cnt); 
for (i =0; i < cnt; ++i) 
sum += va_arg(ap, int); 
va_end(ap); 
return sum: 


/* sum the arguments */ 


/* startup */ 

/* get next argument */ 
/* cleanup */ 


A.l 1 Common Definitions: <stddef,h> 

This header file contains some type definitions and macros that are commonly used in 
other places. How they are implemented is system-dependent, but here is one way of 
doing it: 


typedef 

char 

wchar_t; 


typedef 

i nt 

ptrdiff_t 

m 

typedef 

unsigned 

size_t; 


fdefine 

NULL 

((void *) 

0) 

fdefine 

offsetof(s. 

_type, m) 

\ 


C(size_t) &(((s_type *) 0) -> m)) 

Here, we defined the wide character type wchar_t as a plain char. A system can define 
it to be any integral type. It must be able to hold the largest extended character set of 
all the locales that are supported. The type ptrdi ff_t is the type obtained when two 
pointers are subtracted. The type size_t is the type obtained with use of the sizeof 
operator. A macro call of the form offsetof (s_ type, m) computes the offset in bytes 
of the member m from the beginning of the structure s_type. The following program 
illustrates its use: 
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#include 
#include 


<stdio.h> 
<stddef.h> 


typedef struct { 
double a, b, c, 

} data; 

int main(void) 

{ 

printf("%d %d\n , 

offsetof(data, a), offsetof(data, b)); 
return 0; 


On most systems, this program causes 0 and 8 to be printed. 


A.l2 Input/Output: <stdio.h> 

This header file contains macros, type definitions, and prototypes of functions used by 
the programmer to access files. Here are some example macros and type definitions. 

#define BUFSIZ 1024 /*buf size for all I/O*/ 

fdefine EOF (-1) /*returned on EOF*/ 

#define FILENAME_MAX 255 /*max filename chars >7 

#define FOPEN MAX 20 /*max open files*/ 

fdefine L_tmpnam 16 /*size tmp fi1ename*/ 

#define NULL 0 /*null pointer value */ 

fdefine TMP_MAX 65535 /*max unique filenames*/ 

typedef long pos_t; /*used with fsetposO */ 

typedef unsigned size_t; /*type from sizeof op*/ 

typedef char * vajlist; /*used with vfprintfCW 

The structure type FILE has members that describe the current state of a file. The 
name and number of its members are system-dependent. Here is an example. 
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typedef struct { 

int cnt; 

unsigned char *b_ptr; 
unsigned char *base; 
int bufsize 

short flag; 

char fd; 

} FILE; 


/* size of unused part of buf */ 
/* next buffer loc to access */ 
/* start of buffer */ 

/* buffer size */ 

/* info stored bitwise */ 

/* file descriptor */ 


extern FILE 


-iob[] 


An object of type FILE should be capable of recording all the information needed to 
control a stream, including a file position indicator, a pointer to its associated buffer, 
an error indicator that records whether a read/write error has occurred, and an end-of- 
file indicator that records whether the end-of-file mark has been reached. How this is 
implemented is system-dependent. For example, the error indicator and the end-of-file 
indicator might be encoded bitwise in the structure member flag. 

Typically, the type f pos_t is given by: 

typedef long fpos_t; 

An object of this type is supposed to be capable of recording all the information 
needed to uniquely specify every position in a file. 

Macros are used to define stdi n, stdout, and stderr. Although we t hink of them as 
tiles, they are actually pointers. 


#define 
#define 
#define 


stdi n 

stdout 

stderr 


(&_iob[0]) 

0&_iob[l]) 

(&_iob[2]) 


Unlike other files, stdin, stdout, and stderr do not have to be opened explicitly by 
the programmer. 

A few macros are intended for use with functions: 


#define 
#define 
#define 
#define 
#define 
#define 


_IOFBF 

_IOLBF 

_IONBF 

SEEK_SET 

SEEK_CUR 

SEEKEND 


0x80 

0x04 


/* setvbuf(): full buffering */ 
/* setvbuf(): line buffering */ 
/* setvbufC): no buffering */ 

/* fseekO: file beginning */ 

/* fseek(): current file pos */ 
/* fseekO: EOF */ 


When a file is opened, the operating system associates it with a stream and keens 
information about the stream in an object of type FILE. A pointer to FILE can be 
thought of as being associated with the file or the stream or both. 
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Opening, Closing, and Conditioning a File 


FILE *fopen(const char ’'‘filename, const char *mode) 


Performs the necessary housekeeping to open a buffered file. A successful call creates a 
stream and returns a pointer to FILE that is associated with the stream. If f i 1 ename 
cannot be accessed, NULL is returned. The basic file modes are "r", "w", and "a", corre¬ 
sponding to read, write, and append, respectively. The file position indicator is set at 
the beginning of the file if the file mode is "r" or "w", and it is set at the end of the file 
if the file mode is "a". If the file mode is "w" or "a" and the file does not exist, it is cre¬ 
ated. An update mode (both reading and writing) is indicated with a +. A binary file is 
indicated with a b. For example, the mode "r+" is used to open a text file for both 
reading and writing. The mode "rb" is used to open a binary file for reading. The mode 
"rb+" or "r+b" is used to open a binary file for reading and writing. Similar conven¬ 
tions apply to "w" and "a." (See Section 11.9, “File Access Permissions,” on page 517.) 
In update mode, input may not be directly followed by output unless the end-of-file 
mark has been reached or an intervening call to one of the file positioning functions 
fseekO, fsetposO, or rewind() has occurred. In a similar fashion, output may not 
be directly followed by input unless an intervening call to f f1 ush () or to one of the file 
positioning functions fseekO, fsetposO, or rewindO has occurred. 


H int fclose(FILE *fp); 

Performs the necessary housekeeping to empty buffers and break all connections to the 
file associated with fp. If the file is successfully closed, zero is returned. If an error 
occurs or the file was already closed, EOF is returned. Open files are a limited resource. 

At most, FOPEN.MAX files can be open simultaneously. System efficiency is improved by 

keeping only needed files open. 


■ int fflush(FILE *fp); 

Any buffered data is delivered. If the call is successful, zero is returned; otherwise, EOF 
is returned. 


■ FILE *freopen(const char *filename, 

const char *mode, FILE *fp); 

Closes the file associated with f p, opens f i 1 ename as specified by mode, and associates 
fp with the new file. If the function call is successful, f p is returned; otherwise, NULL is 
returned. This function is useful for changing the file associated with stdi n, stdout, or 
stderr. 
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a void setbuf(FILE *fp, char *buf); 

If fp is not NULL, the function call setbuf (fp, buf) is equivalent to 
setvbuf(fp, buf, _IOFBF, BUFSIZ) 
except that nothing is returned. If fp is NULL, the mode is _IONBF. 


■ int setvbuf(FILE *fp, char *buf, int mode, size_t n); 

Determines how the file associated with f p is to be buffered. The function must be 
invoked after the file has been opened but before it is accessed. The modes _IOFBF, 
„IOLBF, and _I0NBF cause the file to be fully buffered, line buffered, and unbuffered, 
respectively. If buf is not NULL, the array of size n pointed to by buf is used as a buffer. 
If buf is NULL, the system provides the buffer. A successful call returns zero. Caution: If 

an array of storage class automatic is used as a buffer, the file should be closed before 
the function is exited. 


■ FILE *tmpfile(void); 

Opens a temporary file with mode "wb+" and returns a pointer associated with the file. 
11 the request cannot be honored, NULL is returned. The system removes the file after it 
is closed, or on program exit. 

■ char *tmpnam(char *s); 

Creates a unique temporary name that is typically used as a file name. If s is not NULL, 
the name is stored in s, which must be of size L_tmpnam or larger. If s is NULL, the sys¬ 
tem provides an array of static duration to store the name. Further calls to tmpnamO 
can overwrite this space. In all cases, the base address of the array in which the name is 

stored is returned. Repeated calls to tmpnamO will generate at least TMP_MAX unique 
names. 


Accessing the File Position Indicator 

Functions in this section are used by the programmer to access a file randomly. The tra¬ 
ditional functions for this purpose are fseekQ, ftell (), and rewindQ. ANSI C has 
added fgetposO and fsetpos(). An implementation can design these functions to 
access files that are too large to be handled by the traditional functions. However, early 
versions of many ANSI C compilers have not taken advantage of this opportunity. 



■ int fseek(FILE *fp, long offset, int place); 

Sets the file position indicator for the next input or output operation. The position is 
offset bytes from place. The value of place can be SEEK_SET, SEEK_CUR, or 
SEEK_END, which correspond to the beginning of the file, the current position in the 
file, or the end of the file, respectively. If the function call is successful, the end-of-file 
indicator is cleared and zero is returned. 

■ long ftell(FILE *fp); 

Returns the current value of the file position indicator for the file associated with f p. 
On a binary file, this value is a count of the number of bytes from the beginning of the 
file. For text files on some systems, this value is a “magic cookie.” In any case, by saving 
the value returned, fseek() can be used to reset the file position indicator. An unsuc¬ 
cessful call returns -1 and stores a system-dependent value in errno. 

■ void rewind(FILE *fp); 

Sets the file position indicator to the beginning of the file and clears the end-of-file and 
error indicators. The function call rewi nd(fp) is equivalent to 

(void) fseekCfp, 0L, SEEK_SET) 
except that fseekQ clears only the end-of-file indicator. 

■ int fgetpos(FILE *fp, fpos_t *pos); 

Gets the current value of the file position indicator for the file associated with f p and 
stores it in the object pointed to by pos. The stored value can be used later by fset- 
pos() to reset the file position indicator. A successful call returns zero; otherwise, a 
system-dependent value is stored in errno, and a nonzero value is returned. 

■ int fsetpos(FILE *fp, const fpos_t *pos); 

Sets the file position indicator to the value pointed to by pos. A successful call clears 
the end-of-file indicator and returns zero; otherwise, a system-dependent value is writ¬ 
ten to errno, and a nonzero value is returned. 
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Error Handling 

H void clearerr(FILE *fp); 

Clears the error and end-of-file indicators for the file associated with f p. 
m int feof (FILE *f p) ; 

Returns a nonzero value if the end-of-file indicator has been set for the file associated 
with fp. 

■ int ferror(FILE *fp); 

Returns a nonzero value if the error indicator has been set for the file associated with 
fp. 

■ void perror(const char *s); 

Prints an error message associated with errno on stderr. First, the string s is printed, 
followed by a colon and a space. Then the associated error message is printed, followed 
by a newline. {The function call strerror(errno) prints only the associated error mes¬ 
sage.) 

Character Input/Output 

■ int getc(FILE *fp); 

Equivalent to fgetc(), except that it is implemented as a macro. Since fp may be eval¬ 
uated more than once in the macro definition, a call with an argument that has side- 
effects, such as fgetc(*p++), may not work correctly. 

■ int getchar(void); 

The call getcharO is equivalent to getc(stdi n). 

B char *gets(char *s); 

Reads characters from stdi n and stores them in the array pointed to by s until a new- 
line is read or the end-of-file is reached, whichever occurs first. At this point, any new- 
line is discarded and a null character is written. (In contrast, fgetsQ preserves the 
newline.) If any characters are written, s is returned; otherwise, NULL is returned. 


fl int fgetc(FILE *fp); 

Gets the next character from the file associated with f p and returns the value of the 
character read. If the end-of-file is encountered, the end-of-file indicator is set, and EOF 
is returned. If an error occurs, the error indicator is set, and EOF is returned. 

■ char *fgets(char *line, int n, FILE *fp); 

Reads at most n - 1 characters from the file associated with f p into the array pointed 
to by 1 i ne. As soon as a newline is read into the array or an end-of-file is encountered, 
no additional characters are read from the file. A null character is written into the array 
to end the process. If an end-of-file is encountered right at the start, the contents of 
line are undisturbed, and NULL is returned; otherwise, 1 ine is returned. 

■ int fputc(int c, FILE *fp); 

Converts the argument c to an unsi gned char and writes it in the file associated with 
fp. If the call fputc(c) is successful, it returns 

(int) (unsigned char) c 
Otherwise, it sets the error indicator and returns EOF. 

b int fputs(const char *s, FILE *fp); 

Copies the null-terminated string s into the file associated with f p, except for the ter¬ 
minating null character itself. (The related function puts () appends a newline.) A suc¬ 
cessful call returns a nonnegative value; otherwise, EOF is returned. 

■ int putc(int c, FILE *fp); 

The putc() function is equivalent to fputcQ, except that it is implemented as a 
macro. Since f p may be evaluated more than once in the macro definition, a call with an 
argument that has side-effects, such as putc(*p++), may not work correctly. 

m int putchar(int c); 

The call putchar(c) is equivalent to putc(c, stdout). 
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■ int puts(const char *s); 

Copies the null-terminated string s to the standard output file, except the terminating 
null character itself. Then a newline is written. (The related function fputsO does not 
append a newline.) A successful call returns a nonnegative value; otherwise, EOF is 
returned. 

■ int ungetc(int c, FILE *fp); 

Pushes the value (unsigned char) c back onto the stream associated with fp, pro¬ 
vided the value of c is not EOF. At least one character can be pushed back. (Most sys¬ 
tems allow more.) Pushed back characters will be read from the stream in the reverse 
order in which they were pushed back. Once they have been read, they are forgotten; 
they are not placed permanently in the file. Caution: An intervening call to one of the 
file positioning functions fseek(), fsetposO, or rewindO causes any pushed-back 
characters to be lost. Also, until the pushed-back characters have been read, f tel 1 () 
may be unreliable. 

Formatted Input/Output 

■ int fprintf(FILE *fp, const char *cntrl_string, ...); 

Writes formatted text into the file associated with f p and returns the number of charac¬ 
ters written. If an error occurs, the error indicator is set, and s a negative value is 
returned. Conversion specifications, or formats, can occur in cntrl_stri ng. They 
begin with a % and end with a conversion character. The formats determine how the 
other arguments are printed. (See Section 11.1, “The Output Function printfO,” on 
page 493.) 

■ int printf(const char *cntrl_string, ...); 

A function call of the form pri ntf (cntrl_stri ng, other .arguments) is equivalent to 

fprintf(stdout, cntrl_string, other .arguments') 

■ int sprintf(char *s, const char *cntrl_string, ...); 

This is the string version of pri ntf (). Instead of writing to stdout, it writes to the 
string pointed to by s. 



MS 
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m int vfprintf(FILE *fp, const char *cntrl„stri ng, 

va_list ap); 

int vprintf(const char *cntr1_string, va_Jist ap); 
int vsprintf(char s*, const char *cntrl_string, 

va_J i st ap) ; 

These functions correspond to fpri ntf (), printfO, and spri ntf (), respectively. 
Instead of a variable length argument list, they have a pointer to an array of arguments 
as defined in stdarg.h. 

■ int fscanf(FILE *fp, const char *cntrl_string, ...); 

Reads text from the file stream associated with f p and processes it according to the 
directives in the control string. There are three lands of directives: ordinary characters, 
white space, and conversion specifications. Ordinary characters are matched, and white 
space is matched with optional white space. A conversion specification begins with a % 
and ends with a conversion character; it causes characters to be read from the input 
stream, a corresponding value to be computed, and the value to be placed in memory at 
an address specified by one of the other arguments. If the function is invoked and the 
input stream is empty, EOF is returned; otherwise, the number of successful conver¬ 
sions is returned. (See Section 11.2, “The Input Function scanf (),” on page 499.) 

m int scanf(const char *cntrl_string, ...); 

A function call of the form scanf (cntrl_stri ng, other .arguments ) is equivalent to 

fscanf (stdi n, cntrl_string, other .arguments) 

a int sscanf(const char *s , const char *cntrl_string, ...); 

This is the string version of scanf (). Instead of reading from stdi n, it reads from the 
string pointed to by s. Reading from a string is unlike reading from a file; if we use 
sscanf () to read from s again, then the input starts at the beginning of the string, not 
where we left off before. 
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Direct Input/Output 

The functions fread() and fwriteO are used to read and write binary files, resp< 
tively. No conversions are performed. In certain applications, the use of these functio 
can save considerable time. 

■ size_t fread(void *a_ptr, size_t el_size, size_t n, 

FILE *fp); 

Reads at most n * el_si ze bytes (characters) from the file associated with fp into t 
array pointed to by a_ptr. The number of array elements successfully written 
returned. If an end-of-file is encountered, the end-of-file indicator is set and a sh( 
count is returned. If el_size or n is zero, the input stream is not read, and zero 
returned. 

■ size_t fwrite(const void *a_ptr, size_t el_size, 

size_t n, FILE *fp); 

Reads n * el_size bytes (characters) from the array pointed to by a_ptr and writ 
them to the file associated with f p. The number of array elements successfully writt 
is returned. If an error occurs, a short count is returned. If el_size or n is zero, t 
array is not accessed, and zero is returned. 


Removing or Renaming a File 

a int remove(const char *fi1ename); 

Removes the file with the name filename from the file system. If the call is successf 
zero is returned; otherwise, -1 is returned. (This is the uni i nk() function in traditioi 
C.) 

ffl int rename(const char *from, const char *to); 

Changes the name of a file. The old name is in the string pointed to by from. The n< 
name is in the string pointed to by to. If a file with the new name already exists, wl 
happens is system-dependent; typically, in UNIX the file is overwritten. On most s^ 
terns, the old and new names can be either files or directories. If one of the argumei 
is a directory name, the other one must be too. Zero is returned if the call is successf 
otherwise, -1 is returned, and a svstem-deDendent value is written to errno. 
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A.13 General Utilities: <stdl ib.h> 

This header file contains prototypes of functions for general use, along with related 
macros and type definitions. Here are some examples of the macros and type defini- 

ft Art q « 
llUIiD * 

#include <stddef.h> /* for size_t and wchar_t */ 

#define EXIT_SUCCESS 0 /* for use with exitQ */ 

#define EXIT_FAILURE 1 /* for use with exitQ */ 

#define NULL 0 /* null pointer value */ 

#defi ne RAND_MAX 32767 /* 2M5 - 1 */ 

typedef struct { 

int quot; /* quotient */ 

int rem; /* remainder */ 

} div_t; 

typedef struct { 

long quot; /* quotient */ 

long rem; /* remainder */ 

} 1div_t; 

Dynamic Allocation of Memory 

■ void *calloc(size_t n, size_t el_size); 

Allocates contiguous space in memory for an array of n elements, with each element 
requiring el_size bytes. The space is initialized with all bits set to zero. A successful 
call returns the base address of the allocated space; otherwise, NULL is returned. 

■ void *malloc(size_t size); 

Allocates a block of space in memory consisting of size bytes. The space is not initial¬ 
ized. A successful call returns the base address of the allocated space; otherwise, NULL 
is returned. 

■ void *realloc(void *ptr, size_t size); 

Changes the size of the block pointed to by ptr to si ze bytes. The contents of the 
space will be unchanged up to the lesser of the old and new sizes. Any new space is not 
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initialized. The function attempts to keep the base address of the block the same; if this 
is not possible, it allocates a new block of memory, copying the relevant portion of the 
old block and deallocating it. If ptr is NULL, the effect is the same as calling malloc(). 
If ptr is not NULL, it must be the base address of space previously allocated by a call to 
cal 1 oc(), mal loc(), or real 1 oc() that has not yet been deallocated by a call to 
freeO or realloc(). A successful call returns the base address of the resized (or 
new) space; otherwise, NULL is returned. 

B void free(void *ptr); 

Causes the space in memory pointed to by ptr to be deallocated. If ptr is NULL, the 
function has no effect. If ptr is not NULL, it must be the base address of space previ¬ 
ously allocated by a call to calloc(), malloc(), or realloc() that has not yet been 
deallocated by a call to f ree() or reallocC); otherwise, the call is in error. The effect 
of the error is system-dependent. 

Searching and Sorting 

a void *bsearch(const void *key_ptr, const void *a_ptr, 

size_t n_els, size_t el_size, 

int compare(const void *, const void *)); 

Searches the sorted array pointed to by a_ptr for an element that matches the object 
pointed to by key_ptr. If a match is found, the address of the element is returned; oth¬ 
erwise, NULL is returned. The number of elements in the array is n_el s, and each ele¬ 
ment is stored in memory in el_size bytes. The elements of the array must be in 
ascending sorted order with respect to the comparison function compareQ. The com¬ 
parison function takes two arguments, each one being an address of an element of the 
array. The comparison function returns an int that is less than, equal to, or greater 
than zero, depending on whether the element pointed to by its first argument is consid¬ 
ered to be less than, equal to, or greater than the element pointed to by its second argu¬ 
ment. (The function bsearchQ uses a binary search algorithm.) 

■ void qsort(void *a_ptr, size_t n_els, size_t el_size, 

int compare(const void *, const void *)); 

Sorts the array pointed to by a_ptr in ascending order with respect to the comparison 
function compare(). The number of elements in the array is n_els, and each element is 
stored in memory in el_size bytes. The comparison function takes two arguments, 
each one being an address of an element of the array. The comparison function returns 
an i nt that is less than, equal to, or greater than zero, depending on whether the ele¬ 
ment pointed to by its first argument is considered to be less than, equal to, or greater 
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than the element pointed to by its second argument. (By tradition, the function 
qsortQ implements a “quicker-sort” algorithm.) 


Pseudo Random-Number Generator 

B int rand(void); 

Each call generates an integer and returns it. Repeated calls generate what appears to 
be a randomly distributed sequence of integers in the interval [0, RAND_MAX], 

B void srand(unsigned seed); 

Seeds the random-number generator, causing the sequence generated by repeated calls 
to randQ to start in a different place each time. On program startup, the random-num¬ 
ber generator acts as it srand(l) had been called. The statement 

srand(time(NULL)); 

can be used to seed the random-number generator with a different value each time the 
program is invoked. 


Communicating with the Environment 

fl char *getenv(const char *name); 

Searches a list of environment variables provided by the operating system. If name is 
one of the variables in the list, the base address of its corresponding string value is 

returned; otherwise, NULL is returned. (See Section 11.12, “Environment Variables,” on 
page 521.) 

B int system(const char *s); 

Passes the string s as a command to be executed by the command interpreter (the shell) 
provided by the operating system. If s is not NULL and a connection to the operating 
system exits, the function returns the exit status returned by the command. If s is 

NULL, the function returns a nonzero value if the command interpreter is available via 
this mechanism; otherwise, it returns zero, 
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Integer Arithmetic 

■ int abs(int i); 
long labs(long i); 

Both functions return the absolute value of i. 

■ div_t div(int numer, int denom); 
ldiv_t ldiv(long numer, long denom); 

Both functions divide numer by denom and return a structure that has the quotient and 
remainder as members. The following is an example: 

div_t d; 
d = div(17, 5); 

printf("quotient = %d, remainder = %d\n", d.quot, d.rem); 

When executed, this code prints the line 

quotient = 3, remainder = 2 


String Conversion 

Members of the two families ato...() and strto...() are used to convert a string to a 
value. The conversion is conceptual; it interprets the characters in the string, but the 
string itself does not change. The string can begin with optional white space. The con¬ 
version stops with the first inappropriate character. For example, both of the function 
calls 

strtod("123x456", NULL) and strtod("\n 123 456", NULL) 

return the doubl e value 123.0. The strto...() family provides more control over the 
conversion process and provides for error checking. 

■ double atof(const char *s); /* ascii to floating number */ 

Converts the string s to a doubl e and returns it. Except for error behavior, the function 
call 

atof(s) is equivalent to strtod(s, NULL) 


If no conversion takes place, the function returns zero. 
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■ int atoi(const char *s); /* ascii to integer */ 

Converts the string s to an int and returns it. Except for error behavior, the function 
call 

atoi (s) is equivalent to (int) strtol(s, NULL, 10) 

If no conversion takes place, the function returns zero. 

■ long atoi(const char *s); /* ascii to long */ 

Converts the string s to a long and returns it. Except for error behavior, the function 
call 


atoi (s) is equivalent to strto! (s, NULL, 10) 

If no conversion takes place, the function returns zero. 

■ double strtod(const char *s, char **end_ptr); 

Converts the string s to a doubl e and returns it. If no conversion takes place, zero is 
returned. If end_ptr is not NULL and conversion takes place, the address of the charac¬ 
ter that stops the conversion process is stored in the object pointed to by end_ptr. If 
end_ptr is not NULL and no conversion takes place, the value s is stored in the object 
pointed to by end_ptr. On overflow, either HUGE_VAL or -HUGE_VAL is returned, and 
ERANGE is stored in errno. On underflow, zero is returned, and ERANGE is stored in 
errno. 


■ long strtol(const char *s, char **end_ptr, int base); 

Converts the string s to a 1 ong and returns it. If base has a value from 2 to 36, the dig¬ 
its and letters in s are interpreted in that base. In base 36, the letters a through z and A 
through Z are interpreted as 10 through 35, respectively. With a smaller base, only 
those digits and letters with corresponding values less than the base are interpreted. If 
end_ptr is not NULL and conversion takes place, the address of the character that stops 
the conversion process is stored in the object pointed to by end_ptr. For example, 

char *p; 
long value; 

value = strtol("12345", &p, 3); 

printf("value = %ld, end string = \"%s\"\n", value, p); 
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When executed, this code prints the line 
value = 5, end string = "345" 

Since the base is 3, the character 3 in the string "12345" stops the conversion process. 
Only the first two characters in the string are converted. In base 3, the characters 12 get 
converted to decimal value 5. In a similar fashion, the code 

value = strtol("abcde", &p, 12); 

printf("value = %ld, end string = \"%s\"\n", value, p); 
prints the line 

value = 131, end string = "cde" 

Since the base is 12, the character c in the string "abcde" stops the conversion process. 
Only the first two characters in the string are converted. In base 12, the characters ab 
get converted to decimal value 131. 

If base is zero, s is interpreted as either a hexadecimal, octal, or decimal integer, 
depending on the leading nonwhite characters in s. With an optional sign and 0x or 0X, 
the string is interpreted as a hexadecimal integer (base 16). With an optional sign and 0, 
but not 0x or 0X, the string is interpreted as an octal integer (base 8). Otherwise, it is 
interpreted as a decimal integer. 

If no conversion takes place, zero is returned. If end_ptr is not NULL and no conver¬ 
sion takes place, the value s is stored in the object pointed to by end_.pt r. 011 (j^ c i flo, 
either LONGJ4AX or -L0NG_MAX is returned, and ERANGE is stored in errno. 

n unsigned long strtoul(const char *s, 

char **end_ptr, int base); 

The strtoul () function is similar to strtol (), but returns an unsigned long. On 
overflow, either ULONG_MAX or -ULONG_MAX is returned. 


Multibyte Character Functions 

Multibyte characters are used to represent members of an extended character set. How 7 
the members of an extended character set are defined is locale-dependent. 

■ int inblen(const char *s, size_t n) ; 

If s is NULL, the function returns a nonzero or zero value, depending on whether multi¬ 
byte characters do or do not have a state-dependent encoding. If s is not NULL, the 



function examines at most n characters in s and returns the number of bytes that com¬ 
prise the next multibyte character. If s points to the null character, zero is returned. If s 
does not point to a multibyte character, the value -1 is returned. 

■ int mbtowc(wchar_t *p, const char *s, size_t n); 

Acts the same as mblenQ, but with the following additional capability: If p is not NULL, 
the function converts the next multibyte character in s to its corresponding wide char¬ 
acter type and stores it in the object pointed to by p. 

m int wctomb(char *s, wchar_t wc); 

If s is NULL, the function returns a nonzero or zero value, depending on whether multi- 
byte characters do or do not have a state-dependent encoding. If s is not NULL and wc is 
a wide character corresponding to a multibyte character, the function stores the multi¬ 
byte character in s and returns the number of bytes required to represent it. If s is not 
NULL and wc does not correspond to a multibyte character, the value -1 is returned. 


Multi byte String Functions 

m size_t mbstowcs(wchar_t *wcs, const char *mbs, size_t n); 

Reads the multibyte string pointed to by mbs and writes the corresponding wide charac¬ 
ter string into wcs. At most, n wide characters are written, followed by a wide null char¬ 
acter. If the conversion is successful, the number of wide characters written is returned, 
not counting the final wide null character; otherwise, -1 is returned. 

h int wcstombs(char *mbs, const wchar_t *wcs, size_t n); 

Reads the wide character string pointed to by wcs and writes the corresponding multi¬ 
byte string into mbs. The conversion process stops after n wide characters have been 
written or a null character is written, whichever comes first. If the conversion is suc¬ 
cessful, the number of characters written is returned, not counting the null character (if 
any); otherwise, -1 is returned. 
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Leaving the Program 

B void abort(void); 

Causes abnormal program termination, unless a signal handler catches SIGABRT and 
does not return. It depends on the implementation whether any open files are properly 
closed and any temporary files are removed. 

a int atexit(void (*func)(void)); 

Registers the function pointed to by func for execution upon normal program exit, A 
successful call returns zero; otherwise, a nonzero value is returned. At least 32 such 
functions can be registered. Execution of registered functions occurs in the reverse 
order of registration. Only global variables are available to these functions. 

fl void exit(int status); 

Causes normal program termination. The functions registered by atexi t () are invoked 
in the reverse order in which they were registered, buffered streams are flushed, files 
are closed, and temporary files that were created by tmpfi 1 e() are removed. The value 
status, along with control, is returned to the host environment. If the value of status 
is zero or EXIT_SUCCESS, the host environment assumes that the program executed 
successfully; if the value is EXIT_FAILURE, it assumes that the program did not execute 
successfully. The host environment may recognize other values for status. 


A.14 Memory and String Handling: <string.h> 

This header file contains prototypes of functions in two families. The memory func¬ 
tions mem. . .() are used to manipulate blocks of memory of a specified size. These 
blocks can be thought of as arrays of bytes (characters). They are like strings, except 
that they are not null-terminated. The functions str. . .() are used to manipulate null- 
terminated strings. Typically, the following line is at the top of the header file: 
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Memory-Handling Functions 

B void *memchr(const void *p, int c, size_t n); 

Starting in memory at the address p, a search is made for the first unsigned character 
(byte) that matches the value (unsi gned char) c. At most, n bytes are searched. If suc¬ 
cessful, a pointer to the character is returned; otherwise, NULL is returned. 

B int memcmpfconst void *p, const void *q, size_t n); 

Compares two blocks in memory of size n. The bytes are treated as unsigned charac¬ 
ters. The function returns a value that is less than, equal to, or greater than zero, 
depending on whether the block pointed to by p is lexicographically less than, equal to, 
or greater than the block pointed to by q. 

B void *memcpy(void *to, void *from, size_t n); 

Copies the block of n bytes pointed to by from to the block pointed to by to. The value 
to is returned. If the blocks overlap, the behavior is undefined. 

B void *memmove(void *to, void *from, size_t n); 

Copies the block of n bytes pointed to by from to the block pointed to by to. The value 
to is returned. If the blocks overlap, each byte in the block pointed to by from is 
accessed before a new value is written in that byte. Thus, a correct copy is made, even 
when the blocks overlap. 

B void *memset(void *p, int c, size_t n); 

Sets each byte in the block of size n pointed to by p to value (unsigned char)c. The 
value p is returned. 


String-Handling Functions 

B char *strcat(char *sl, const char *s2); 

Concatenates the strings si and s2. That is, a copy of s2 is appended to the end of si. 
The programmer must ensure that si points to enough space to hold the result. The 
string si is returned. 


#include <stddef.h> 


/* for NULL and size_t */ 
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m char *strchr(const char *s, int c); 

Searches for the first character in s that matches the value (char) c. If the character is 
found, its address is returned; otherwise, NULL is returned. The call strchr(s, '\0') 
returns a pointer to the terminating null character in s. 

■ int strcmp(const char *sl, const char *s2); 

Compares the two strings si and s2 lexicographically. The elements of the strings are 
treated as unsigned characters. The function returns a value that is less than, equal to, 

or greater than zero, depending on whether si is lexicographically less than, equal to, 
or greater than s2. 

B int strcoll(const char *sl, const char *s2); 

Compares the two strings si and s2 using a comparison rule that depends on the cur¬ 
rent locale. The function returns a value that is less than, equal to, or greater than zero, 
depending on whether si is considered less than, equal to, or greater than s2. 

■ char *strcpy(char *sl, const char *s2); 

Copies the string s2 into the string si, including the terminating null character. What¬ 
ever exists in si is overwritten. The programmer must ensure that si points to enough 
space to hold the result. The value si is returned. 

■ size_t strcspn(const char *sl, const char *s2); 

Computes the length of the maximal initial substring in si consisting entirely of char¬ 
acters not in s2. For example, the function call 

strcspn("April is the cruelest month", "abc") 

returns the value 13, because "Apri 1 i s the " is the maximal initial substring of the 
first argument having no characters in common with "abc". (The character c in the 
name strcspn stands for “complement,” and the letters spn stand for “span.”) 

^ char *strerror(int error_number); 

Returns a pointer to an error string provided by the system. The contents of the string 
must not be changed by the program. If an error causes the system to write a value in 
errno, the programmer can invoke strerror(errno) to print the associated error 
message. (The related function perrorQ can also be used to print the error message.) 
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■ size_t strlen(const char *s); 

Returns the length of the string s. The length is the number of characters in the string, 
not counting the terminating null character. 

■ char *strncat(char *sl, const char *s2, size_t n); 

At most, n characters in s2, not counting the null character, are appended to si. Then a 
null character is written in si. The programmer must ensure that si points to enough 
space to hold the result. The string si is returned. 

■ int strncmp(const char *sl, const char *s2, size_t n); 

Compares at most n characters lexicographically in each of the two strings si and s2. 
The comparison stops with the nth character or a terminating null character, whichever 
comes first. The elements of the strings are treated as unsigned characters. The func¬ 
tion returns a value that is less than, equal to, or greater than zero, depending on 
whether the compared portion of si is lexicographically less than, equal to, or greater 
than the compared portion of s2. 

■ char *strncpy(char *sl, const char *s2, size_t n); 

Precisely n characters are written into si, overwriting whatever is there. The characters 
are taken from s2 until n of them have been copied or a null character has been copied, 
whichever comes first. Any remaining characters in si are assigned the value ' \0'. If 
the length of s2 is n or larger, si will not be null-terminated. The programmer must 
ensure that si points to enough space to hold the result. The value si is returned. 

■ char *strpbrk(const char *sl, const char *s2); 

Searches for the first character in si that matches any one of the characters in s2. If the 
search is successful, the address of the character found in si is returned; otherwise, 
NULL is returned. For example, the function call 

strpbrk("Apri1 is the cruelest month", "abc") 

returns the address of c in cruel est. (The letters pbrk in the name strpbrk stand for 
“pointer to break.”) 

■ char *strrchr(const char *s, int c); 

Searches from the right for the first character in s that matches the value (char) c. If 
the character is found, its address is returned; otherwise, NULL is returned. The call 
strchr(s, '\0') returns a pointer to the terminating null character in s. 
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■ size_t strspn(const char *sl, const char ,v s2) ; 

Computes the length of the maximal initial substring in si consisting entirely of char¬ 
acters in s2. For example, the function call 

strspn("Apri1 is the cruelest month", "A is for apple") 

returns the value 9, because all the characters in the first argument preceding the t in 
the occur in the second argument, but the letter t does not. (The letters spn in the 
name strspn stand for “span.”) 

■ char *strstr(const char *sl, const char *s2); 

Searches in si for the first occurrence of the substring s2. If the search is successful, a 

pointer to the base address of the substring in si is returned; otherwise, NULL is 
returned, 

■ char *strtok(char *sl, const char *s2); 

Searches for tokens in si, using the characters in s2 as token separators. If si contains 
one or more tokens, the first token in si is found, the character immediately following 
the token is overwritten with a null character, the remainder of si is stored elsewhere 
by the system, and the address of the first character in the token is returned. Subse¬ 
quent calls with si equal to NULL return the base address of a string supplied by the 
system that contains the next token. If no additional tokens are available, NULL is 
returned. The initial call strtok(sl, s2) returns NULL if si contains no tokens. The 
following is an example: 

char sl[] = " this is,an example ; 
char s2[] 
char *p; 

printf("\"%s\"", strtok(sl, s2)); 
while ((p = strtok(NULL, s2)) != NULL) 
printfC" \"%s\"", p); 
putcharf'\n'); 


When executed, this code prints the line 
"this" "is" "an" "example" 
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■ size_t strxfrin(char *sl, const char *s2, size_t n); 

Transforms the string s2 and places the result in si, overwriting whatever is there. At 
most, n characters, including a terminating null character, are written in si. The length 
of si is returned. The transformation is such that when two transformed strings are 
used as arguments to strcmpO, the value returned is less than, equal to, or greater 
than zero, depending on whether strcoll () applied to the untransformed strings 
returns a value less than, equal to, or greater than zero. (The letters xf rm in the name 
strxf rm stand for “transform.”) 




A.15 Date and Time: <time.h> 

This header file contains prototypes of functions that deal with date, time, and the 
internal clock. Here are examples of some macros and type definitions: 

finclude <stddef.h> /* for NULL and size_t */ 

#define CLOCKS_PER_SEC 60 /* machine-dependent */ 

typedef long clock_t; 

typedef long time_t; 

Objects of type struct tm are used to store the date and time, 

struct tm { 

int tm_sec; /* seconds after the minute: [0, 60] */ 


int tm_min; /* minutes after the hour: [0, 59] *{ 
int tm_hour; /* hours since midnight: [0, 23] */ 
int tm_mday; /* day of the month: [1, 31] */ 
int tm_mon; /* months since January: [0, 11] */ 
int tm_year; /* years since 1900 */ 
int tm_wday; /* days since Sunday: [0, 6] */ 
int tm_yday; /* days since 1 January: [0, 365] */ 
int tm_isdst; /* Daylight Savings Time flag */ 

}; 


Note that the range of values for tm_sec has to accommodate a “leap second,” which 
occurs only sporadically. The flag tm_i sdst is positive if Daylight Savings Time is in 
effect, zero if it is not, and negative if the information is not available. 
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Accessing the Clock 


On most systems, the clockQ function provides access to the underlying machine 
clock. The rate at which the clock runs is machine-dependent. 

■ clock_t clock(void); 

Returns an approximation to the number of CPU “clock ticks” used by the program up 
to the point of invocation. To convert it to seconds, the value returned can be divided 
by CLOCKS_PER_SEC, If the CPU clock is not available, the value -1 is returned, (See Sec¬ 
tion 11.16, “How to Time C Code,” on page 52S.) 


Accessing the Time 


In ANSI C, time comes in two principal versions: a “calendar time” expressed as an inte¬ 
ger, which on most systems represents the number of seconds that have elapsed since 
1 January 1970, and a “broken-down time” expressed as a structure of type struct tm. 
l’he calendar time is encoded with respect to Universal Time Coordinated (UTC). The 
programmer can use library functions to convert one version of time to the other. Also, 
functions are available to print the time as a string. 

■ time_t time(time_t *tp); 

Returns the current calendar time, expressed as the number of seconds that have 
elapsed since 1 January 1970 (UTC). Other units and other starting dates are possible, 
but these ate the ones typically used. If tp is not NULL, the value also is stored in the 
object pointed to by tp. Consider the following code: 


time_t now; 
now = time(NULL); 

printf("\n%s%ld\n%s%s%s%s\n", 

II _ now = ", now, 

|| , ctimeC&now) = ", ctimeC&now), 

"asctime(localtime(&now)) = ”,asctime(localtime(&now))); 

When executed on our system, this code printed these lines: 

now = 685136007 

ctimeC&now) = Tue Sep 17 12:33:27 1991 
asctime(1ocaltime(&now)) = Tue Sep 17 12:33:27 1991 



m char *asctime(const struct tm *tp); 

Converts the broken-down time pointed to by tp to a string provided by the system. 
The function returns the base address of the string. Later calls to asctimeQ and 
ctimeO overwrite the string. 

■ char *ctime(const time_t *t_ptr); 

Converts the calendar time pointed to by t_ptr to a string provided by the system. The 
function returns the base address of the string. Later calls to asctimeO and ctime() 
overwrite the string. The two function calls 

ctime(&now) and asctime(local time(&now)) 

are equivalent. 

■ double difftime(time_t t0, time_t tl); 

Computes the difference tl - t0 and, if necessary, converts this value to the number 
of seconds that have elapsed between the calendar times t0 and tl. The value is 
returned as a double. 

B struct tm *gmtime(const time_t *t_ptr); 

Converts the calendar time pointed to by t_ptr to a broken-down time and stores it in 
an object of type struct tm that is provided by the system. The address of the struc¬ 
ture is returned. The function computes the broken-down time with respect to Univer¬ 
sal Time Coordinated (UTC). This used to be called Greenwich Mean Time (GMT); hence, 
the name of the function. Later calls to gmtime() and localtinieO overwrite the 
structure. 

■ struct tm *1ocaltime(const time_t *t_ptr); 

Converts the calendar time pointed to by t_ptr to a broken-down local time and stores 
it in an object of type struct tm that is provided by the system. The address of the 
structure is returned. Later calls to gmtimeQ and localtimeQ overwrite the struc¬ 
ture. 

■ time_t mktime(struct tm *tp); 

Converts the broken-down local time in the structure pointed to by tp to the corre¬ 
sponding calendar time. If the call is successful, the calendar time is returned; other¬ 
wise, -1 is returned. For the purpose of the computation, the tm_wday and tm_yday 
members of the structure are disregarded. Before the computation, other members can 
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have values outside their usual range. After the computation, the members of the struc¬ 
ture may be overwritten with an equivalent set of values in which each member lies 


within its normal range. The values for tm_wday and tm_yday are computed from those 

for the other members, for example, the following code can be used to find the date 
1,000 days from now: 


struct tm *tp; 
time_t now, later; 

now = time(MULL); 
tp = localtime(&now); 
tp -> tm_mday += 1000; 
later = mktime(tp); 

printf("\nl000 days from now: %s\n", ctime(&later)); 

■ size_t strftime(char *s, size_t n, 

const char *cntrl_str, const struct tm *tp); 

Writes characters into the string pointed to by s under the direction of the control 
string pointed to by cntrl_str. At most, n characters are written, including the null 
character. If more than n characters are required, the function returns zero and the 
contents of s are indeterminate; otherwise, the length of s is returned. The control 
string consists of ordinary characters and conversion specifications, or formats, that 
determine how values from the broken-down time in the structure pointed to by tp are 

to be written. Each conversion specification consists of a % followed by a conversion 
character. 



Using strftimeQ 

Conversion 

specification 

What is printed 

Example 

%a 

abbreviated weekday name 

Fri 

%k 

full weekday name 

Friday 

%b 

abbreviated month name 

Sep 

%B 

full month name 

September 

%c 

date and time 

Sep 01 02:17:23 1993 

%d 

day of the month 

01 

%H 

hour of the 24-hour day 

02 

%h 

hour of the 12-hour day 

02 

%j 

day of the year 

243 

%m 

month of the year 

9 

m 

minutes after the hour 

-j 

%p 

AM or PM 

AM 

%S 

seconds after the hour 

23 

%u 

week of the year (Sun-Sat) 

34 

%W 

day of the week (0-6) 

5 

%x 

date 

Sep 01 1993 

%X 

time 

02:17:23 

%y 

year of the century 

93 

%Y 

year 

1993 

0/7 

time zone 

PDT 

0 / 0 / 

7 o 7 o 

percent character 

% 


Consider the following code: 

char s[100]; 
time_t now; 

now = time(NULL); 

strftime(s, 100, on %A, %d %B %Y", 

localtime(&now)); 
printf("%s\n\n", s); 
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"When we executed a program containing these lines, the following line was printed: 
13:01:15 on Tuesday, 17 September 1991 


A.l 6 Miscellaneous 


In addition to the functions specified by ANSI C, the system may provide other func¬ 
tions in the library. In this section, we describe the non-ANSI C functions that are widely 
available. Some functions, such as exec! (), are common to most systems. Other func¬ 
tions, such as fork() or spawn ! (), are generally available in one operating system but 
not another. The name of the associated header file is system-dependent. 


File Access 


■ int access(const char *path, int amode); 

Checks the file with the name path for accessibility according to the bit pattern con¬ 
tained in the access mode amode. The function prototype is in unistd.h on UNIX systems 

and in io.h on MS-DOS systems. The following symbolic constants are defined in the 
header file: 


F_0K 

Check for existence. 

R_0I< 

Test for read permission. 

W_0K 

Test for write permission. 

X_0l< 

Test for execute or search permission. 


' * / ' ™ — — ^ v. v, u u iiiuuv 

stants. For example, the function call 


access(path, R_0K | W-OK) 

could be used to check whether the file permits both read and write access to the file. 

The function returns 0 if (he requested access is permitted; otherwise, -1 is returned 
and errno is set to indicate the error. 



Using File Descriptors 

■ int open(const char *filename, int flag, ...); 

Opens the named file for reading and/or writing as specified by the information stored 
bitwise in f 1 ag. If a file is being created, a third argument of type unsi gned is needed; 
it sets the file permissions for the new file. If the call is successful, a nonnegative inte¬ 
ger called the file descriptor is returned; otherwise, errno is set and -1 is returned. Val¬ 
ues that can be used for f 1 ag are given in the header file that contains the prototype 
for openQ, These values are system-dependent. 

■ int close(int fd); 

Closes the file associated with the file descriptor f d. If the call is successful, zero is 
returned; otherwise, errno is set and -1 is returned. 

■ int read(int fd, char *buf, int n); 

Reads at most n bytes from the file associated with the file descriptor fd into the object 
pointed to by buf. If the call is successful, the number of bytes written in buf is 
returned; otherwise, errno is set and -I is returned. A short count is returned if the 
end-of-file is encountered. 

■ int write(int fd, const char *buf, int n); 

Writes at most n bytes from the object pointed to by buf into the file associated with 
the file descriptor fd. If the call is successful, the number of bytes written in the file is 
returned; otherwise, errno is set and -1 is returned. A short count can indicate that the 
disk is full. 


Creating a Concurrent Process 

■ int fork(void); 

Copies the current process and begins executing it concurrently. The child process has 
its own process identification number. When fork() is called, it returns zero to the 
child and the child’s process ID to the parent. If the call fails, errno is set, and -1 is 
returned. This function is not available in MS-DOS. 
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h int vfork(void); 

Spawns a new process in a virtual memory efficient way. The child process has its own 
process identification number. The address space of the parent process is not fully cop¬ 
ied, which is very inefficient in a paged environment. The child borrows the parent's 
memory and thread of control until a call to exec. . .() occurs or the child exits. The 
parent process is suspended while the child is using its resources. When vforkQ is 
called, it returns zero to the child and the child’s process ID to the parent. If the call 
laris, errno is set, and -1 is returned. This function is not available in MS-DOS. 


Overlaying a Process 


In this section, we describe the two families exec... () and spawn...(). The first is gen¬ 
erally available on both MS-DOS and UNIX systems, the second only on MS-DOS sys¬ 
tems. On UNIX systems, fork() can be used with exec. ..() to achieve the effect of 
spawn.. .(). 

■ int execl(char "name, char *arg0, char *argN); 

int execle(char *name, char *arg0, ..., char *argN, 

char **envp); 

int execlp(char “name, char *arg0, ..., char *argN); 
int execlpe(char *name, char *arg0, char *argN, 

char **envp); 

int execvCchar *name, char **argv); 

int execveCchar *name, char **argv, char **envp); 

int execvp(char *name, char **argv); 

int execvpe(char “name, char **argv, char **envp); 

These functions overlay the current process with the named program. There is no 
return to the parent process. By default, the child process inherits the environment of 
the parent, Members of the family with names that begin with execl require a list of 
arguments that are taken as the command line arguments for the child process. The 
last argument in the list must be the NULL pointer. Members of the family with names 
that begin with exeev use the array argv to supply command line arguments to the 
child process. The last element of argv must have the value NULL. Members of the fam¬ 
ily with names ending in e use the array envp to supply environment variables to the 
child piocess. lhe last element of envp must have the value NULL. Members of the fam¬ 
ily with p in their name use the path variable specified in the environment to determine 
which directories to search for the program. 

® int spawn!(int mode, char *name, char *arg0, 

char *argN); 
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This family of functions corresponds to the exec...() family, except that each member 
has an initial integer argument. The values for mode are 0, 1, and 2. The value 0 causes 
the parent process to wait for the child process to finish before continuing. With value 
1, the parent and child processes should execute concurrently, except that this has not 
been implemented yet. The use of this value will cause an error. The value 2 causes the 
child process to overlay the parent process. 


Interprocess Communication 

■ int pipe (int pd[2]) ; 

Creates an input/output mechanism called a pipe, and puts the associated file descrip¬ 
tors (pipe descriptors) in the array pd. If the call is successful, zero is returned; other¬ 
wise, errno is set and -1 is returned. After a pipe has been created, the system assumes 
that two or more cooperating processes created by subsequent calls to fork() will use 
readO and write() to pass data through the pipe. One descriptor, pd[0], is read 
from; the other, pd [1], is written to. The pipe capacity is system-dependent, but is at 
least 4,096 bytes. If a write fills the pipe, it blocks until data is read out of it. As with 
other file descriptors, closeO can be used to explicitly close pd[0] and pd[l]. This 
function is not available in MS-DOS. 


Suspending Program Execution ^ 

m void sleep(unsigned seconds); 

Suspends the current process from execution for the number of seconds requested. 
The time is only approximate. 





















Appendix B 

Language Syntax 



In this appendix, we give an extended BNF syntax for the ANSI version of the C lan¬ 
guage. (See Section 2.2, “Syntax Rules,” on page 73.) This syntax, although intended for 
the human reader, is concisely written. The C language is inherently context-sensitive; 
restrictions and special cases are left to the main text. The conceptual output of the 
preprocessor is called a translation unit. The syntax of the C language pertains to trans¬ 
lation units. The syntax for preprocessing directives is independent of the rest of the C 
language. We present it at the end of this appendix. 


B.l Program 

program {file } 1+ 

file ::= decls_and_fct_definitions 

decls_ amL fct_defin it ions { declaration } 1+ decls_and_fct_definitions 0 p t 

| { function_definition } 1+ decls_and~fct_definitions m t 





















Appendix B 

Language Syntax 



In this appendix, we give an extended BNF syntax for the ANSI version of the C lan¬ 
guage. (See Section 2.2, “Syntax Rules,” on page 73.) This syntax, although intended for 
the human reader, is concisely written. The C language is inherently context-sensitive; 
restrictions and special cases are left to the main text. The conceptual output of the 
preprocessor is called a translation unit. The syntax of the C language pertains to trans¬ 
lation units. The syntax for preprocessing directives is independent of the rest of the C 
language. We present it at the end of this appendix. 


.1 Program 

program ::= {file } 1+ 

file ::= decls_and_fct_definitions 

::= { declaration } 1+ decls_and_fct_definitions opt 

| { function_definition }i + decls_and_fct_definitions opt 


decls~and_fct_definitions 
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B.2 Function Definition 


function_definition 


{extern | static } opt type .specifier 
function.name ( parameter.declaration.list opy ) 
compound.statement 


function_name ::= identifier 

parameter.declarationjist ::= parameter. deciaration { , parameter.declaration } 0 


B.B Declaration 


declaration 


declaration.specifiers init.declaratorJist opt 


declaration.specifiers :: 


storage.dass.specifier.or.typedef declaration_specifiers opt 
I type_specifier decia ra tion. sped fie rs op t 
I type.qualifier declaration_specifiers opt 


storage.class.specifier.or.typedef ::= auto ] extern | register | static 

I typedef 


type .specifier ::= char | double | float I int | long | short | signed 

| unsigned 

| void | enum.specifier | struct_or_union.specifier 
| typedef.name 


enum.specifier ::= enum tag 0pt { enumeratorJist } i enutn tag 


tag 


identifier 


enumerator list 


enumerator { , enumerator } opt 


enumerator ::= enumeration.constant { = const_mtegral_expr} opl 


enumeration constant 


identifier 
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struct.or.union.specifier struct.or.union tag opl { struct.declaration.hst } 

| struct_or.union tag 

struct.or.union ::= struct I union 
struct.declarationjist ::= ( struct.declaration h+ 

stmct.iedaratton O' pespecifler^ualifierjist structuralorjtst ; 

rrifirst' nunlifier list "= type specifier type.specifier.qualifier.Ust opt 
type.specifier.qualifier.list - type.spedfier.qualifierjist opt 

struct.declarator Jist ::= struct.declarator { , struct.declarator } 0+ 

struct.declarator declarator | declarator opt : constJntegraLexpr 

type.qualifier ::= const | volatile 

declarator ::= pointer opt direcLdeclarator 

pointer ::= t* I type.qualifierJist ovt } 1+ 

type.qualifier.list ::= I type.qualifier }i+ 

direct declarator ,identifier | ( dedaiator') 

a ~ | direct.declarator [ constJntegral.expr opt J 

| direct.declarator ( parameter.type Jist ) 

| direct.declarator ( identifier _list opt ) 

parameter, type.list ::= parameter Jist \ parameter.Jist . 

parameter Jist ::= parameter.declaration { , parameter.dedaration } 0+ 

parameter.declaration declaration.specifiers dec ^ a [ or 

y | declaration^speafiers abstracLaeciaraior op t 

abstract.declarator ::= pointer I pointer opt direct.abstract.dedarator 
jivart abstract declarator v.— ( abstract^dcclarator ) 

du ect.abstract.aeciar abstmcLdedara tor opi l const_integralexpr opt } 

| direct.abstract.declarator 0 pt C parameter.type.Jist opt ) 

identifier Jist ::= identifier { , identifier } 0+ 
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switch_block ::= {{ declarationJist } opt case_defaulLgroup } 

case^defaulL group ::= { case_group } 1+ 

| { case_group } 0+ defaulLgroup { case_ group } 0+ 

case_group { case constJntegraLexpr : }x+ { statement } 1+ 

defaulLgroup ::= default : { statement } 1+ 


B.5 Expression 

expression ::= constant | string Jiteral | ( expression ) | lvalue 

| assignmenLexpression \ expression , expression | + expression 
| - expression | f'unction_expression | relationaLexpression 
| equality.expression | logicaLexpression 
| expression arithmetic_op expression | bitwise_expression 
| expression ? expression : expression \ si zeof expression 
| si zeof ( type_name ) | ( type_name ) expression 

lvalue ::= & lvalue | ++lvalue \ lvalue ++ | --lvalue \ lvalue -- 

| identifier | * expression | lvalue [ expression ] | ( lvalue ) 

| lvalue . identifier | lvalue -> identifier 

assignmenLexpression ::= lvalue assignment_op expression 

assignment_op ;;= = I += I -= I *= I /= I %= I &= I A = I 1= I »= I «= 

arithmetic_op + I - I * I / I % 

relationaLexpression ::= expression < expression | expression > expression 

I expression <= expression | expression >= expression 

equality ^expression ::= expression == expression | expression ! = expression 

logicaLexpression ! expression \ expression \ \ expression 

| expression && expression 
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hitwise.expression :: 


expression | a expression 

expression & expression | expression \ expression 
expression « expression \ expression » expression 


function-expression function.namef argumentJist opt ) 

| ( * pointer ) ( argumentJist opt ) 

argumentJist ::= expression { , expression } 0+ 


..pm 

- 'i - 


type.name 


expression { , expression } 0+ 
type-specifier declarator^ 





B.6 Constant 

constant ::= character.constant | enumeration .constant | floating .constant 

I integer.constant 


character.constant ::= 


' c' | L ’ c 


» ^ l 


c ::= any character from the source character set except ' or \ or newline 
I escape.sequence 


V I V I \? I \\ I \a | \b | \f | \n | \r 
I \ octal.digit octal_digit opt octal.digit opt 
I \x hexadecimal .digit { hexadecimal.digit } 0+ 


escape.sequence 


enumeration-constant ::= identifier 

string.Iiteral ::= "{character.constant}^" | L string.literal 

floating .constant ::= fractional.constant exponential_part opt floating.suffix,^ 

digit.sequence exponential.part floating.suffix opt 

* 

fractional.constant ::= digit.sequence opX . digit .sequence | digit.sequence . 
dig it.sequence ::= { digit } 1+ 


digit :: 


0I1I2I3I4I5I6I7I8I9 
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exponentiaLpart { e I E h { + I - } op t digit.sequence 

floating.suffix f I F | 1 I L 

integer.constant ::= decimal.constant integer.suffix opt 

| octal.const ant integer.suffix QpX 
| hexadecimaLconstant integer.suffix opX 

decimaLconstant ::= 0 I nonzero.digit digit.sequence 

nonzero.digit ::= 1|2|3|4|5|6|7|8|9 

octal.constant ::= 0 { octaLdigit }o+ 

octal.digit ::= 0|1|2|3|4|5|6|7 

hexadecimaLconstant { 0x ( 0X}i { hexadecimal-digit } 1+ 

hexadecimal-digit ::= 0 I 1 I 2 I 3 I 4 I l | 6 | 7 1 ® j , P 

a|b|c|d|e|f|A|B|C|D|E|F 

integer .suffix ::= unsigned_suffix long.suffix opt \ long.suffix unsig ned.su ffix opt 

unsigned.suffix ::= u | U 
long.suffix ::= 1 I L 


B.7 String Literal 

string.literal :;= " s.char .sequence ” I L" s.char.sequence " 

s.char.sequence ::= {5c} 1 + 

sc ::= any character from the source character set except " or \ or newline 
| escape.sequence 
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.o preprocessor 


preprocessing_directive ::= controLline newline \ if_section | ppjoken newline 


controlJine ::= # include {< identifier > \ "identifier"} 

I # undef identifier | # line ppjoken | # error pp_token 
| # pragma ppjoken 

| # define identifier { ( identifierJtst ) } opt { ppjoken } 0+ 


pp_token ::= identifier | constant | stringjiteral \ operator | punctuator 

I pp_token ## pp_token | # identifier 



if.section ::= ifigroup { elif.group }q + { else.group } opt endjfjine 


if-group :: 


# if constJntegraLexpr newline preprocessing_directive opt 
| # i f def identifier newline { preprocessing .directive } opt 
I # i f ndef identifier newline { preprocessing.directive } opt 


elif.group ::= # el if constant.expression newline { preprocessing.directive } op | 


else, group 


# else newline { preprocessing.directive } opt 


endjfjine # endif newline 


newline 


the newline character 




* 







Cl Types 

■ The keyword si gned has been added to the language. 

^ Three types of characters are specified: plain char, signed char, and 
unsigned char. An implementation may represent a plain char as either a 
signed char or an unsigned char. 

ib The keyword si gned can be used in declarations of any of the signed integral types 
and in casts. Except with char, its use is always optional. 

■ In traditional C, the type 1 ong float is equivalent to double. Since long float 
was rarely used, it has been removed from ANSI C. 

■ The type 1 ong doubl e has been added to ANSI C. Constants of tins type are speci¬ 
fied with the suffix L. A 1 ong doubl e may provide more precision and range than a 
doubl e, but it is not required to do so. 

■ The keyword void is used to indicate that a function takes no arguments or t 
returns no value. 
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■ The type voi d * is used for generic pointers. For example, the function prototvne 
for mal 1 oc () is given by 

void *mal1oc(size_t size); 

A generic pointer can be assigned a pointer value of any type, and a variable of any; 
pointer type can be assigned a generic pointer value. Casts are not needed. In con¬ 
trast, the generic pointer type in traditional C is char *. Here, casts are necessary^! 

■ Enumeration types are supported. An example is 

enum day {sun, mon, tue, wed, thu, fri, sat}; 

The enumerators in this example are sun, mon,..., sat. Enumerators are constants 
of type i nt. Thus, they can be used in case labels in swi tch statements. 


C.2 Constants 

■ String constants separated by white space are concatenated. Thus, 

"abc" 

"def" "ghi" is equivalent to "abcdefghi" 

■ String constants are not modifiable. (Not all compilers enforce this.) 

a The type of a numeric constant can be specified by letter suffixes. Some examples 
are 

12 3L /* 

123U /* 

123LIL /* 

1.23F /* 

1.23L /* 

Suffixes may be lower- or uppercase. A numeric constant without a suffix is a type 
big enough to contain the value. 

s The digits 8 and 9 are no longer considered octal digits. They may not be used in an 
octal constant. 




long V 
unsigned */ 
unsigned long */ 
float */ 
long double */ 
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■ Hexadecimal escape sequences beginning with \x have been introduced. As with 
octal escape sequences beginning with \0, they are used in character and string 
constants. 


C.3 Declarations 

■ The type qualifier const has been added. It means that variables so declared are 
not modifiable. (Compilers do not always enforce this.) 

■ The type qualifier vol ati 1 e has been added. It means that variables so declared 
are modifiable by an agent external to the program. For example, some systems put 
the declaration 

extern volatile int errno; 
in the header file errno.h. 


C.4 Initializations 


■ In ANSI C, automatic aggregates such as arrays and structures can be initialized. In 
traditional C, they must be external or of storage class stati c. 


Unions can be initialized. An initialization refers to the union’s first member. 


■ Character arrays of size n can be initialized using a string constant of exactly n 
characters. An example is 

char today[3] = "Fri"; 

The end-of-string sentinel \0 in "Fri" is not copied into today. 
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C.6 T Functions 697 


C.5 Expressions 

a For reasons of symmetry, a unary plus operator has been added to the language. 

B In traditional C, expressions involving one of the commutative binary operators 
such as + or * can be reordered at the convenience of the compiler, even though 
they have been parenthesized in the program. For example, in the statement 


x 


(a + b) + c; 


the variables can be summed by the compiler in some unspecified order. In ANSI C, 
this is not true. The parentheses must be honored. 

A pointer to a function can be dereferenced either explicitly or implicitly. If, for 
example, f is a pointer to a function that takes three arguments, then the expres¬ 
sion 


f(a, b, c) 


is equivalent to 


(*f) (a, b, c) 


The si zeof operator yields a value of type si ze_t. The type definition for si ze_t 
is given in stddef.h. 
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A pointer of type void * cannot be dereferenced without first casting it to an 
appropriate type. However, it can be used in logical expressions, where it is com¬ 
pared to another pointer. 
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C.6 Functions 


.-afflteJT 


ANSI C provides a new function definition syntax. A parameter declaration list 
occurs in the parentheses following the function name. An example is 


int f(int a, float b) 

{ 


m 


In contrast, the traditional C style is 

int f(a, b) 
int a; 

float b; 

{ 


ANSI C provides the function prototype, which is a new style of function declara¬ 
tion. A parameter type list occurs in the parentheses following the function name. 
Identifiers are optional. For example, 

int f(int, float); ancl int f(int a, float b); 
are equivalent function prototypes. In contrast, the traditional C style is 

int f(); 

If a function takes no arguments, then voi d is used as the parameter type in the 
function prototype. If a function takes a variable number of arguments, then the 
ellipsis is used as the rightmost parameter in the function prototype. 

Redeclaring a parameter identifier in the outer block of a function definition is ille¬ 
gal. The following code illustrates the error: 

void f(int a, int b, int c) 

{ 

int a; /* error: a cannot be redefined here */ 


Although this is legal in traditional C, it is almost always a programming error. 

Tnri ppH it ran hp a difficult liner to finrl 

Structures and unions can be passed as arguments to functions, and they can be 
returned from functions. The passing mechanism is call-by-value, which means that 
a local copy is made. 






C.7 Conversions 


■ An expression of type fl oat is not automatically converted to a doubl e. 

■ When arguments to functions are evaluated, the resulting value is converted to the 
type specified by the function prototype, provided the conversion is compatible. 
Otherwise, a syntax error occurs. 

■ Arithmetic conversions are more carefully specified. (See Section 3.11, “Conver¬ 
sions and Casts,” on page 131.) In ANSI C, the basic philosophy for conversions is to 
preserve values, if possible. Because of this, the rules require some conversions on 
a machine with 2-byte words to be different from those on a machine with 4-byte 
words. 

■ The resulting type of a shift operation is not dependent on the right operand. In 
ANSI C, the integral promotions are performed on each operand, and the type of 
the result is that of the promoted left operand. 


C.8 Array Pointers 

■ Many traditional C compilers do not allow the operand of the address operator & to 
be an array. In ANSI C, since this is legal, pointers to multidimensional arrays can 
be used. Here is an example: 

int a[2][3] = {2, 3, 5, 7, 11, 13}; , 

int C*p)[][3]; /* the first dimension 

need not be specified */ 

p = &a; 

printf(''%d\n”, (*p)[l][2]); /* 13 is printed */ 




C.9 T Structures and Unions 699 



C.9 Structures and Unions 

H Structures and unions can be used in assignments. If si and s2 are two structure 
variables of the same type, the expression si = s2 is valid. Values of members in 
s2 are copied into corresponding members of si. 

■ Structures and unions can be passed as arguments to functions, and they can be 

returned from functions. All arguments to functions, including structures and 
unions, are passed call-by-value. 

■ If m is a member of a structure or union and the function call f () returns a struc¬ 
ture or union of the same type, then the expression f (). m is valid. 

a Structures and unions can be used with the comma operator and in conditional 
expressions. Some examples are 

int a, b; 

struct s si, s2, s3; 


(a. si) /* comma expression having structure type */ 

a < b ? si : s2 /* conditional expression struct type */ 

a If expr is a structure or union expression and m is a member, then an expression of 
the form expr. m is valid. However, expr. m can be assigned a value only if expr can. 
Even though expressions such as 

(si = s2). in (a, sl).m (a < b ? si : s2).m f().m 

are valid, they cannot occur on the left side of an assignment operator. 
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CIO Preprocessor 

■ Preprocessing directives do not have to begin in column 1. 

■ The following predefined macros have been added: 

_DATE_ _FILE _ __LINE__ __STDC__ __TIME„ 

They may not be redefined or undefined. (See Section 8.9, “The Predefined Macros,” 
on page 387.) 

■ a macro may not be redefined without first undefining it. Multiple definitions are 
allowed, provided they are the same. 


The preprocessor operators # and ## have been added. The unary operator # causes 
the “stringization” of a formal parameter in a macro definition. The binary operator 
## merges tokens. (See Section 8.10, “The Operators # and ##,” on page 387.) 


■ The preprocessor operator def i ned has been added. (See Section 8.8, Conditional 
Compilation,” on page 384.) 


B The preprocessing directives #elif, terror, and #pragma have been added. (See 
Section 8.8, “Conditional Compilation,” on page 384, and Section 8.12, “The Use of 
#error and #pragma,” on page 389.) 


a in traditional C, toupperQ and tolowerC) are defined as macros in ctype.h: 


#define toupper(c) ((O-'a' + 'A 1 ) 

#define tolower(c) ((c)-'A'+'a') 

The macro call toupper(c) will work properly only when c has the value of a low¬ 
ercase letter. Similarly, the macro call tolowerCc) will work properly only when c 
has the value of an uppercase letter. In ANSI C, toupperQ and tolowerO are 
implemented either as functions or as macros, but their behavior is different. If c 
has the value of a lowercase letter, then toupper(c) returns the value of the corre¬ 
sponding uppercase letter. If c does not have the value ol a lowercase letter, then 
the value c is returned. Similar remarks hold with respect to tolowerO. 

m In ANSI C, every macro is also available as a function. Suppose stdio.h has been 
included. Then putchar(c) is a macro call, but (putchar) (c) is a function call. 
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.11 Header Files 

b ANSI C has added new header files. The header file stdlib.h contains function proto¬ 
types for many of the functions in the standard library. 

a The header files float.h and limits.h contain macro definitions describing implemen¬ 
tation characteristics. ANSI C requires that certain minimum values and ranges be 
supported for each arithmetic type. 


Cl 2 Miscellaneous 

m In traditional C, the operators += and -+ are synonymous, although the use of =+ is 
considered old-fashioned. In ANSI C, the use of =+, =*, and so on is not allowed. 

m An ANSI C compiler treats an assignment operator such as +- as a single token. In 
traditional C, since it is treated as two tokens, white space can occur between the + 
and =. Thus, the expression a + = 2 is legal in traditional C but illegal in ANSI C 

m Each of the following has a distinct name space: label identifiers, variable identifi¬ 
ers, tag names, and member names for each structure and union. All tags for enum, 
struct, and union comprise a single name space. 

m Two identifiers are considered distinct if they differ within the first n characters, 
where n must be at least 31. 

H The expression controlling a switch statement can be any integral type. Floating 
types are not allowed. The constant integral expression in a case label can be any 
integral type, including an enumerator. 

m Pointers and i nts are not interchangeable. Only the integer 0 can be assigned to a 
pointer without a cast. 

m Pointer expressions may point to one element beyond an allocated array. 
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Appendix D v ASCII Character Codes 


How to Read the Table 


Observe that the character A is in row six, column five. This means that the char¬ 
acter A has value 65. 


Some Observations 

■ Character codes 0 through 31 and 127 are nonprinting. 

■ Character code 32 prints a single space. 


Character codes for digits 0 through 9 are contiguous, letters A through Z are 
contiguous, and letters a through z are contiguous. 


The difference between a capital letter and the corresponding lowercase letter is 
32. 


The meaning of some of the abbreviations 


tael 

audible bell 

ht 

horizontal tab 

tas 

backspace 

nl 

/A T1 T l l /A 

newline 

er 

carriage return 

nul 

null 

esc 

escape 

vt 

vertical tab 


Note: On most UNIX systems, the command man ascii causes the ASCII table to be 
printed on the screen in decimal, octal, and hexadecimal. 





The table below shows precedence and associativity for all the C++ operators. In case of 
doubt, parenthesize. 


Operators 

Associativity 

() [] -> . ++ (.postfix') -- (postfix) 

loft f/\ rinlif 
id l lU ligiil 

++ ( nrc>fix ) — ( vrefix) i - sizeof (hvub) 

+ (unary) - (unary S) & (address) * (indirection) 

right to left 

* / % 

ieii lu ngm 

+ 


« » 

left to right 

ft* ft* ft*,* <&- .ft, *Pv *ft, ft* 

II 

A 

A 

II 

V 

V 

left to right 

~ mm * mmm 

left to right 

Si 

left to right 

A 

Ipf't tn ri o'I t f 

JLvUJL L t w 1.1 lit 

1 

left to right 

Mi 

left to right 

ii 

left to right 

? * 

right to left 

= += -= Vf = /= *= »= <<= Si- A= | = 

right to left 

, ( comma operator ) 

left to right 



















































Symbols 

! negation operator, 
147-148, 154—155 
!= not equal operator, 125, 
147, 152-154 

# preprocessor directive, 7, 

ij, yi, joD 

# preprocessor operator, 

387-388, 700 
## preprocessor operator, 
387-388, 700 
% modulus operator, 52, 
81-82 

%% format, 494 
%= modulus equal operator, 
88 

& address operator, 19, 44, 
77 748 751 428 

/ J Cm | i£m X U | » / 1 J I fc*. U 

& and (bitwise) operator, 

’loo 334 

d d (I* i d d jt 

&& and (logical) operator, 
147, 154-157 
&= and equal operator, 88 
() function call operator, 8 , 
207 

() parentheses, 7,12, 34, 
41, 71, 82-83,125,181, 
198, 200, 248, 368 
(...) ellipsis, 30, 202 

# dereferencing or indirec¬ 

tion operator, 44, 248, 
750 758 78Q 


*= times equal operator, 88 
+ plus (unary) operator, 696 
+ plus operator, 84 
++ increment operator, 
85-86 

+- plus equal operator, 88 
, comma operator, 82, 
171-172, 699 

- minus operator, 84, 90 

— decrement operator, 

85-86 

-= minus equal operator, 88 
-> member access operator, 
411,413-414,428 
. member access operator, 
408, 411, 414 

/* V comment pair, 11, 71, 
75-76 

// comment, 76, 98 
// comment (C++), 594 
// comment (Java), 626 
/= divide equal operator, 88 
; semicolon terminator, 8, 
11, 63, 82, 87, 108, 158 

< less than operator, 147, 

149-151 

< redirect input, 27, 56-57 
« left shift operator, 332, 

335-336 

« put to operator (C++), 
595 


«= shift left equal opera¬ 
tor, 88 

<= less than or equal opera 
tor, 23, 147, 149-150 
<> angle bracket, 7 
= assignment operator, 10 , 
12,87-88, 108-109 
== equal operator, 21 , 95, 

i ,47 *1 r* o 1 TO 
1 4 / 7 1 3 I 3 3 

> greater than operator, 

147, 149-150 

> redirect output, 56-57 
>» greater than or equal op 

erator, 147, 149-150 
» get from operator (C++), 
597 

» right shift operator, 332, 

$■**4 |M*» 

335-336 

»= shift right equal opera¬ 
tor, 88 

?: conditional expression 
operator, 182-183 
[] subscript operator, 37, 
245-248 253 262 

tmnr V L/ 4hm -JL ^ tmtr. V.* W* ^ iCmw V.' *— 

\ backslash, 79-80, 106, 
112-113 

\" double quote, 8 , 72, 
80-81, 113 

\? question mark, 113 
V single quote, 113 
\0 end-of-string sentinal, 
41-42, 113; 270-271 
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a exclusive or (bitwise) oper¬ 
ator, 332, 334 
or equal (exclusive) oper¬ 
ator, 88 

_ underscore, 78-79 
{ } braces, 7, 22, 82, 108, 
157,280-282 

| or (bitwise) operator, 332, 
334 

| vertical bar as choice sep¬ 
arator, 73-74 

| - or equal (inclusive) oper- 
aior, oo 

| | or (logical) operator, 147, 
154-157 

- complement operator, 
332-333 
0 zero, 148, 223 

A 

\a alert, 113-114 
a.out, 6, 54-55, 523 
abc, 43 

abc structure, 428-429 
abortO, 389, 670 
absQ, 130, 648, 666 
abstract base class (C++), 
611 

abstract data type, 430 
access, 429 
clock, 676 
date functions, 675 
file, 513, 517 
file position indicator, 
656 

random, 513 
structure, 411, 417 
time functions, 675-676 
acos(),647 
Ada, 77 
addQ, 283 
add_vector(), 562 

add3, 599-600 
add3Q, 599 
addition, 12 
address, 208, 250 


address operator &, 19, 44, 
72, 248, 251,428 
ADT, 430 
in Java, 629 
list, 447-460 
matrix, 577-579 
shape (C++), 610-612 
stack, 430, 432-435, 
460-463 

stack (C++), 613—615 
string (C++), 601-607 
student (C++), 609 
aggregate variable, 407 
alert \a, 113-114 
ALGOL 60, 73 
algorithm 

binary search, 664 
quicker sort, 665 
allocation 

71, 577, 663 
dynamic, 571 
memory, 254, 259, 577, 
663, 671 

and (bitwise) operator 
332, 334 

and (logical) operator &&, 
147, 154-157 
and equal operator &=, 88 
angle bracket <>, 7 
ANSI, 1, 3 
ANSI C 

floating types, 119 
vs, traditional, 693-702 
applet (Java), 635 
appletviewer (Java), 636 
or, 105, 526 
or command, 526 
archiver, 105, 526 
argc, 48, 290-291 
argument, 29, 201 
call-by-reference, 252 
function, 293-295 
macro, 368-370, 377 
structure, 416 
to mainQ, 48, 290 
variable, 651 
argv, 290-291 


arithmetic 

conversion, 131 
expression, 131 
operator, 81 

pointer, 36, 254-255, 257 
type, 111 

array, 37, 46, 245-308 
allocation, 571, 577 
as parameter, 256 
base address, 254 
character, 36, 39, 80 
element size, 255 
function argument, 256 
in structures, 409 
initialization, 246-247, 
260, 281, 695 
merge, 263 
multidimensional, 
277-290, 698 
name, 42 

of pointer, 284, 572 
of structures, 409 
pointer, 253, 698 
ragged, 292-293 
size, 246 

subscript range, 575 
three-dimensional, 280 
two-dimensional, 

277-278 

ASCII Character Codes, 111, 
703 

asctimeO, 677 

asinQ, 647 
asm (Borland), 77 
assertQ, 212-213, 388, 
641 

assert h f 212, 267, 388, 641 
assigm_values(),420 
assignment operator =, 10, 
12, 87-88, 108-109 
assignment operators, 

87-89 

associativity, 83-84, 148, 
150, 332,415, 705 
atanQ, 647 
atan2(), 647 
atexitQ, 670 


:>■ 

“$r. 


atof()»666 

atoi(), 667 
atolO, 667 
auto, 77, 205, 216-217 
automatic aggregate initial¬ 
ization, 695 
automatic conversion, 
132-133 
averageO, 599 
awk utility, 540 

B 

backslash \, 79-80, 106, 
112-113 

backspace \b, 113 
Backus-Naur Form, 73 
backward , 514 
base class (C++), 608, 616 
binary operator, 82, 148 
binary search algorithm, 

664 

binary tree, 475-483 
creation, 478-479 
traversal, 477-478 
binaryJree, 476, 480 
bisection, 296-297, 299 
bisectionC), 298, 304 
bison utility, 540 
bit, 429 

bit field, 427-429 
bit shift operator, 335 
biLprint , 338 
bit_print(),338-339 
bitwise logical operator, 334 
bitwise operator, 331-344, 
4?9 

block, 108, 157, 198, 
213-214 
C++, 595 
debug, 216 
nested, 215 
parallel, 215 
storage allocation, 215 
BNF, 73 
booLvals, 170 
boolean variable, 169 


Borland keywords 
asm, 77 
cdecl, 77 
far, 77 
huge, 77 
interrupt, 77 
near, 77 
pascal, 7/ 
bound, 246 

braces, 7, 22, 82, 108, 157, 
280-282 

break, 77, 179, 182 
bsearchf), 664 
bubble sort, 38, 257-258 
bubbleQ, 257 
buiIdtreeC),483 
byte, 136 

C 

ANSI, 1, 3 
standard, 1, 3 
traditional, 1 
%c format, 494 
.cfile extension, 6, 54-55, 
209, 523, 527, 532-533 
C system, 69, 91-92 
C++, 2-3, 593-616 
ADT, 601 ' 

benefits, 615 
class, 601 
constructor, 606 
destructor, 606-607 
exception, 614 
function, 599 
inheritance, 608 
input, 595 
output, 594 
overloading, 603 
polymorphism, 610 
template, 612 
call-by-reference, 36, 208, 

£** Lit L J \ ' 

call-by-reference (C++), 
599-600 


Call-by-reference is accom- 
plished, 253 
call-by-value, 33, 35, 
207-208 

callocO, 259-260,482, 
573,663-664 

cali$y 651 

capitalize , 126 
card structure, 407-409, 
419 

carriage return \r, 113 
case, 77, 181-182, 347, 694 
cast, 38, 131, 133-134 
cast conversion operator, 
134 

catch (C++), 615 
cb utility, 539 
cc command, 6, 53-54, 211 
cdecl (Borland), 77 
ceil(), 648 
change^case, 515 
char, 77, 110-111, 114-115 
CHAR3IT, 339 
character, 70 
array, 39, 80 
bit representation, 114 
constant, 79, 344 
conversion, 19-20, 494 
functions, 642 
input/output, 658 
mapping, 643 
multibyte functions, 668 
nonprinting, 112 
printing, 111 
testing, 642 
white space, 70 
check^bits, 429 
circle class (C++), 611 
class (C++), 601 
abstract base, 611 
base, 608, 616 
container, 612 
initialization, 606 
type, 601 

class (java), 625,627, 629 
method, 630 











710 


T Index 


▼ Index 711 



classes (C++) 
ci rcle, 611 
grad_student, 609 
rectangle, 611 
square, 612 
stack, 613 

string, 601, 603-604, 
606 

student,609 
classes (Java) 

Improved, 626 
Moon, 628 

NoSuchNameException, 

636 

Person,629 
Personl, 634 
Person2, 637 
PersonTest, 630 
Student,633-634 
StudentTest, 634 
wgcd, 635 
clearerrO, 658 
client (Java), 630 
clock access functions, 676 
clockQ, 104, 528, 530,676 
close(), 681 
closing a file, 655 
cmpO, 373-374, 376 
cnLchar , 165-166 
cntjetters , 49-50 

cntr1_c__handler(), 

565-566 
coercion, 132 
comma operator ,, 82, 
171-172, 699 
command line, 48 
command option, 55 
commands 
6 ?r, 105, 526 
cc, 6, 53-54, 211 
echo , 583 
lib , 526 
Is , 56 
mv, 54 
rename , 55 

frr s 3 ^ ^ 

tUb 9 526 


comment, 75 
comment (C++)//, 594 
comment (Java) //, 626 
comment //, 76, 98 
comment pair /* */, 11,71, 
75-76 

common definitions, 652 
compareQ, 354, 372 
compared ractional_ 
part(), 381 
compare^sorts , 534 
comparison function, 372 
Compilation Process figure, 
70 

compiler, 6, 53-55, 69, 91, 

K O r ) 

L..4F *«>|» | » ■—I 

conditional, 384-385 
error, 54 

math library use, 130 
option, 130, 370, 523 
profiler, 524 
complement operator 

QQ r > QQQ 
d d (U J) d 

complex , 413-414 
complex structure, 413 
compound statement, 22, 

1 |MM> 

a/ 


concurrent process, 567, 
681 

concurrenLsum , 561, 563 
conditional compile, 
384-385 

conditional expression op¬ 
erator ?:, 182-183 
conditioning a file, 655 
consecutive^sums , 23 
const, 1/, 77, 272, 
307-308, 596, 695 
constant, 79, 694 
character, 79 
enumeration, 79 
floating-point, 13, 79, 
119-120 

hexadecimal, 117, 134 
integer, 13, 79, 117 
multibyte character, 344 
negative, 80 


numeric, 694 
octal, 117, 134 
pointer, 46, 308 
string, 8, 80-81, 270, 
694-695 

symbolic, 14, 367, 385 
constructor (C++), 606 
overload, 607 
constructor (Java), 631 
Constructs not to be point¬ 
ed. at, 251 

container class (C++), 612 
continue, 77, 179-180, 182 
control string, 12, 18 
control-c, 56, 564 
eontrol-d, 56 
control-z, 114 
conversion, 10, 13, 38, 133, 
200, 208, 251, 698 
arithmetic, 131 
automatic, 132-133 
cast operators, 134 
character, 19-20, 494 
implicit, 132 
specification, 12,18,495, 
502 

cos(), 127, 647 
coshQ, 647 
countQ, 455 
count_it(), 456 
create^employee„da- 
ta(), 344 

create_tree(), 479 
Creating maxmin Program, 
30 

csh utility, 540 
ctimeQ, 677 
ctypeM, 382, 641-642 


D 

#define, 13-14, 16, 
366-368 
%d format, 494 
dangling else, 162 
data, 468 

data hiding (C++), 601 


data hiding (Java), 630 
data structure, 465 
date, a 18 

date functions, 675 
date structure, 410 

_DATE_, 387, 700 

dbLout , 57 
dbLspace, 507-508 
dbLwith^caps , 510 
dbx utility, 539 
deadlock, 567 
deal_the_cards(),423 
debug, 91, 216, 370, 539 
decimal, 137 
decimal integer, 80 
decimal point, 13 
declaration, 11, 107-108, 
110, 122, 128, 695 
external, 702 
declarations 

const, 17, //, 272, 
307-308, 596, 695 
typedef, 77, 122, 282, 
371,408,410,651 
volatile, 77, 307-308, 
695 

decrement operator 
85-86 


default, 77, 182 
default initialization, 223 
def i ned preprocessor oper¬ 
ator, 700 

delete (C++), 60/ 
demotion, 133 
dependency line, 532, 537 
Hpnf tn i c 1 11 re 416 

v*. W V I vW„: v V~ if I i. V/ 

dequeue(),472 
dereferencing or indirection 
operator *, 44, 248, 

250, 253, 289 

derived type, 245, 408, 424 
descriptor, 514, 681 
design 
OOP, 609 
top-down, 204 
destructor (C++), 606-607 
diagnostics, 641 


diff utility, 539 
difftimeQ, 677 
digit, 70 
dining , 568 

Dining philosophers, 567 
direct input/output, 662 
directory, 209 
discriminant, 191 
dissection 
abc , 43 

bit_print(), 3 3 9 
change^case, 515 
cnLchar, 166 
cntjetters , 50 
compare_fractional_ 
part(), 381 
concurrenLsum , 563 
consecutive^sum , 23 
dbLspace , 508 
dbLwith_caps, 511 
double^out, 124 
fai 1 (), 412 
fib_signal, 566 
fibonacci , 176 
flmLroots, 305 
go Jay shell script, 581 
improved, 594, 626 
inLor_ float, 425 
mainC) in sorLwords, 

90 r 
w O D 

makefile for compare „ 
sorts , 534 
marathon , 11 
maxmin , 32 
mergesortO, 266 
mLkm, 598 
moon , 596, 628 
nice^day, 40 
output from gojry , 583 
pacific., sea, 16 
play„poker(),422 
pow_ofJ2 , 90 
pmjrand, 94 
recursive function 
move(), 232 
running_sum f 27 
sea , 7 


sort.h Header File, 378 
sqrLpow , 128 
stack, 432 
strcatQ, 276 
strcpyC), 2/4 
string functions, 'Smw ^ 
string_to„list(),452 
sum , 71 
swap(), 253 
try_qsort, 374 
unpack(), 312 
wrLbkwrds , 226 
div(), 666 

divide equal operator /=, 88 
dMsion, 12 
do, 77, 172-173, 367 
domain error, 644, 646-648 
dot_product(), 283 
double, 77, 110-111, 119, 
693 

double quote \", 8, 72, 
80-81, 113 
double_out , 124 
double_space(), 507 
dynamic allocation, 571 

E 

%e format, 494 

file extension, 6, 55 
e-format, 17 
EBCDIC, 111 
echo , 20 

echo command, 583 
editor, 54 
efficiency, 227 
eigen value, 571 
elem structure, 465, 471 
#elif, 386, 700 
ellipsis (.„), 30, 202 
#e1 se, 386 

picp 77 

empty statement, 158 

emptyO, 432,434, 462,473 

end-of-ffle, 51, 56, 125, 654 
end-of-string sentinal \0, 
41-42, 113, 270-271 
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#endif, 384,386 
end! (C++), 595 
enqueueC),473 
enum, 77, 694 
enumeration, 345-355 
constant, 79 
type, 131. 694 
environment functions, 665 
environment variable, 
521-522 
EOF, 51, 56, 125 
equal operator ==,21,95, 
147, 152-153 

equality operators, 147-148 
errnoJ'h 641, 643, 695 
#error, 389, 700 
error handling, 658 
error_exit(), 562-563 
e r ro r_exit_cal 1 oc_ 
failedQ, 289 
error_exit„too_many_ 
words(),289 
e r ro r„exit_wo rd_too_ 
longO, 290 
errors 

#def i ne with semicolon, 
369 

%lf vs. %f, 129 
—— vs = 367 
assignment vs, equality, 
153 

compile-time, 54 
dangling else, 162 
domain, 644, 646-648 
functions, 643 
header, 701 
infinite loop, 164, 168, 
174 

integer precision, 225 
range, 644, 647 
run-time, 54-55 
syntax, 54-55 
escape sequence, 112, 114 
escape sequence, hexadeci¬ 
mal, 695 

evaluateQ, 467 
exception, 564 


exception (C++), 614 
exception (Java), 636 
exclusive or (bitwise) opera¬ 
tor A, 332, 334 
exec... 0,558, 682 
execl (), 558, 680, 682 
execleQ, 682 
execlp(), 682 
execlpeQ, 682 
executable file, 54 
execution speed, 219 
execvQ, 682 
execve(), 682 
execvpC), 682 
execvpeQ, 682 
exitQ, 27, 670 
expO, 127, 647 
exponential notation, 
119-120 

expression, 10,13,107-109, 
696 

arithmetic, 131 
evaluation, 13 
mixed, 38, 133 
pointer, 255, 701 
statement, 158-159 
extern, 77, 205, 216-219, 
231, 308 

external declaration, 702 
external identifier, 79 
external variable, 199 


%f format, 494 
\f formfeed, 113 
fO 696 

fabs (), 130,648 
factorial(), 225 
fail 0,412 
false, 21, 147-148 
far (Borland), 77 
fclose(), 505, 509, 655 
fct, 210, 295 
feof 0,658 

ferrorO, 658 

fflushO, 655 


fgetc(),659 
fgetposO, 656-657 
fgets(), 658-659 

fibO. 557 

fib^signal, 565-566 
fibonacci , 175-176 
fibonacci(), 227 
field, 19 

field width, 495, 497 
FIFO, 473 
figures 

Array after swapping, 

288 

Array before swapping, 
288 

Array of words, 287 
Compilation Process, 70 
Creating a .h file, 209 
Dining philosophers, 567 
Finding a root by bisec¬ 
tion, 297 

Kepler equation solution, 
300 

Linked List, 448 
Pointer Use, 249 
Ragged array, 293 
FILE, 47, 503, 515, 653 
file, 47, 47-53 
access, 513, 517 
closing, 655 
conditioning, 655 
descriptor, 514, 681 
executable, 54 
header, 7, 14, 91 
mode, 506 
opening, 655 
pointer, 47 
pointer to, 50 
position indicator, 656 
remove, 662 
rename, 662 
source, 5, 534 
standard header, 14 
temporary, 510 

_FILE_, 387, 700 

file!, 218 
file 2 , 218 
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tiles 

.c, 6 , 54-55, 209, 523, 
527, 532-533 
.exe. 6 , 55 

.1% 14, 91, 209, 532, 538 
.o, 211, 523, 532-533, 
538 

.objt 211 
a.oiit , 6 

FILL()» 377, 379 
fill (),468 
filLarray , 260-262 
fill„array(), 261, 373, 
376 

find„next_dayO, 

347-348 

find m pivot(), 392 
find_roots, 297-299, 
303-305 
flex utility, 540 
float, 77, 110-111, 119 
floath , 641, 644, 701 
floating types, 17, 111, 119 
floating-point constant, 13, 
79, 119-120 

floating-point exception, 
564 

floorO, 648 
flow of control, 21-28, 
147-184 

flower structure, 426 
flush (C++), 595 
fmodQ, 648 

fopenQ, 47, 505-506, 512, 
655 

for, 25, 77, 167-169, 171 
fork Of 556-557, 560, 563, 
681 

formal parameter, 34, 198, 
296 
formats 
%%, 494 
%c, 494 
%d, 494 
%e, 494 
%f , 494 
%Q f 494 


%g f 494 
%i, 494 
%n, 494 
%o, 494 
%p, 494 
%s, 494 
%u, 494 
%x, 494 

formatted input/output, 18, 
660 

formfeed \f, 113 
fprintfOf 52, 503, 660 
fputcO, 659 

fputs(), 659 

fread Of 662 

free(), 459, 664 
freopenC),6 d 5 
frexpO, 648 
friend (C++), 603 

frontO. 473 

fruit structure, 409, 426 
fscanf(), 503-504, 661 
fseek(), 513, 655-657, 660 
fsetposO, 655-657, 660 
ftell 0,513, 656-657, 660 
ftp , 211 

full (),432, 434, 463,473 
func(),650 

function, 29-36,197-233, 
696 

argument, 29, 201 
argument to mai n (), 48 
array argument, 256 
array parameter, 279 
as argument, 293-295 
body, 197-198 
C++, 599 

call operator (), 8 , 207 
call-by-reference, 36, 

208, 252-253 
call-by-reference (C++), 
599-600 

call-by-value, 33, 35, 
207-208 
character, 642 
clock, 676 

common definitions, 652 


comparison, 372 
concurrent process, 681 
date, 675 

declaration, 204-205 
definition, 197-200, 
204-206, 256, 696 
diagnostic, 641 
environment, 665 
error, 643 
friend (C++), 603 
general utilities, 663 
graceful, 510 
header, 197 
inline (C++), 598 
input/output, 653 
integer arithmetic, 666 
interprocess communica¬ 
tion, 683 

invocation, 198, 204, 
207-208 
iteration, 225 
Java, 627 
jumps, 649 
localization, 645 
mathematical, 127, 646 
member (C++), 601 
member (Java), 630 
memory allocation, 259, 
577, 663, 671 
memory handling, 671 
multibyte character, 668 
multibyte string, 669 
order, 206 

overlaying process, 682 
overload (C++), 610 
overloading (C++), 603 
override (Java), 633 
parameter, 34, 198-199, 
202, 296 

program leaving, 670 
program suspend, 683 
prototype, 18, 29, 32, 92, 
201-205, 697 
pseudo random number, 
665 

pure virtual (C++), 611 
recursion, 223-232 
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function (cont’d) 
return, 200 
scope, 222 
searching, 664 
signal handling, 650 
sorting, 664 
starting point, 7, 29, 48 
storage class, 205 
string handling, 272-273, 
666, 671 

string multibyte, 669 
time, 675-676 
type, 198-199 
variable argument, 651 
functions and macros 
abort0, 389, 670 
abs(), 130, 648, 666 
acosO, 647 
add(),283 
add_vector(),562 
add3(), 599 

asctimeO, 677 

asin(), 647 

assert(),212-213, 388, 
641 

assign_valuesO, 420 
atan(), 647 
atan2(), 647 
atexit(), 670 
atof(), 666 
atoiQ, 667 
atolO, 667 
averageQ, 599 

bisect! on O, 298, 304 
bit_print(), 338-339 
bsearchC),664 
bubble(), 257 
buildtreeO, 483 
callocC),259-260,482, 
573, 663-664 
cei1(), 648 

clearerrO, 658 
clockO, 104, 528, 530, 

676 

closeO, 681 
cmpO, 373-374, 376 


cntrl_c_handl er(), 

565-566 

compareO, 354, 372 
compare_fractional_ 
part(), 381 
cosO, 127, 647 
coshO, 647 
countC), 455 
count_itQ, 456 
create_employee_da- 
ta(), 344 

create_treeC), 479 
ctime(), 677 
deal_the_cardsO, 423 
dequeuef), 472 
difftimeO, 677 
div(), 666 
dot_product(), 283 
double_space(), 507 
empty(),432, 434, 462, 
473 

enqueueO, 473 
error_exit(), 562-563 
error_exit_cal1oc_ 
failedO, 289 
error_exit_too_many_ 
wordsO, 289 
error_exit_word_too_ 
long(), 290 
evaluateO, 467 
exec...(), 558, 682 
execlQ, 558, 680, 682 
execleO, 682 
execlpO, 682 
execlpeO, 682 
execvQ, 682 
execvef),682 
execvpQ, 682 
execvpeC), 682 
exitO, 27, 670 
expO, 127, 647 
fO, 696 
fabs(),130,648 
factorial(), 225 
fai 1 O, 412 

fcloseO, 505, 509, 655 
feof(),658 


ferror(), 658 

fflushO, 655 

fgetc(), 659 
fgetposO, 656-657 
fgetsQ, 658-659 
fibO, 557 
fibonacciO, 227 
FILLO, 377, 379 
fill(), 468 

fill_array(), 373, 376 

fi nd_next_dayO, 

347-348 

find_pivot(), 392 
floor(), 648 
fmodO, 648 
fopen(), 47, 505-506, 
512, 655 

forkO, 556-557, 560, 
563,681 

fprintf O, 52, 503, 660 
fputcO, 659 
fputs(), 659 
fread(), 662 
f ree(), 459, 664 
freopenO, 655 
frexp(), 648 
frontO, 473 
fscanf(), 503-504, 661 
fseekO, 513, 655-657, 
660 

fsetposO, 655-657, 

660 

ftellO, 513, 656-657, 
660 

full(), 432, 434, 463, 
473 

funcO, 650 

fwriteO, 662 

get_matrix_spaceO, 

578 

get_n_from_user(), 

230 

get_vector_spaceO, 

575 

getcO, 51, 382, 658 
getcharO, 39-40, 
124-125, 382, 658 


getenvO, 665 

getsO, 658 
gfopenO, 511 
gmtime(), 677 
ini tO, 614 
init_gnode(), 481 
init_node(),479 
i nitialize 0,462,468, 
472 

inorderO, 477 
isalnumO, 383, 642 
isa1pha(),40, 383, 642 
isascii(), 383 
iscntrl (), 383, 642 
isdigit(), 383, 642 
isflush 0,424 
isgraphQ, 383, 642 
islowerQ, 383, 642 
isprintO, 383, 642 
ispunctQ, 383, 642 
isspaceC), 271, 353, 
383, 642 

isupper(), 383, 642 
i sxdigit(), 383, 642 
keplerO, 302 
labsQ, 666 
1dexpC),648 
IdivQ, 666 
lexicoO, 382 
localtime(), 677 
logO, 127, 647 
logl0O. 647 
longjmpO, 649 

lrand480, 105 

main(), 7, 29,48 
mallocO, 259-260, 
663-664 
niaximum(), 31 
mblenO, 668-669 
mbstowcsO, 669 
mbtowcC),669 
memchrO, 671 
memcmpO, 671 
memcpyO, 671 
memmoveO, 671 
memsetC), 671 
mergeO, 264, 269 


mergesortC), 266,269 
message_for(), 387 
mktimeO, 677 
modfO, 649 
monitor(),524 
move(), 230 
multiplyOi 283 
NDEBUGO, 389, 641 
new„gnode(), 481 
open(), 681 
operator+O, 604-605 
pack(), 341 
partition(),393 
pclose(),520 
perror(), 644, 658, 672 

philosopherO, 570 

pick_up(), 569 

pipeO, 561, 683 

play_poker(), 421-422 
popO, 432-433, 462 
popen(),520 
postorderO, 478 
powQ, 29, 127, 648 
power(), 204 
preorderO, 478 
PRINTO, 377, 379 
print_list(), 456 
printfO, 18-20, 493, 
495, 660 
printlnO, 627 
prn_array(), 374 
prn__card_val ues(), 
421 

prn_dataO, 470 
prn_final_statusO, 

351 

p rn_garne m status(), 

351 

prn_help(),351 
prn_info(), 31, 508 
p rn_J nst ructi ons (), 

352 

prn_stack(), 470 
prn_tbl_of_powers(), 
203 

prn__tinie(), 529 
probabilityO, 222 


push(), 431, 433, 462, 
468 

put_downO, 569 
putc(), 52, 382, 659 
putcharO, 39-40, 
124-125, 382, 659 
puts(), 659-660 
qsortO. 372, 375, 379, 
381, 664-665 
quicksortO, 372, 392 
raise(), 650 
randO, 94-95,104, 353, 
376,422, 665 
randomO, 222 
read(), 681 
reallocO, 663-664 
release_matrix_ 

,577-578 

removeO, 519, 662 
rename(),662 
report„and„tabu~ 
late(), 355 
reset(), 431, 433 
reverseO, 614 
rewind(), 655-657, 660 
s__to_l 0,453 
scanfO, 18-21,499,661 
selection_by„ma- 

chineO, 352 
selection_by_p1aye r 

O 353 

setbuf(),656 
setjmpO, 649 
set! ocale 0,646 
setvbuf(), 656 
shuffleO, 423 
signal 0,564-566,570, 
650 

sinO, 127, 295, 647 
sinh(), 647 
sleepO, 683 
sort_words(), 288 
spawn...O, 560, 682 
• spawnl(), 560, 680 
sprintfO, 503-504 
sqrt(), 127, 648 
srandQ, 95,422, 665 
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functions and macros 
(cont’d) 

srand48(), 105 
sscanf(), 503-504, 661 
start„time(), 529 
strcat(), 273, 275-276, 
b71 

strchrO, 672 
strcmpQ, 273, 672, 675 
strcoll 0 , 672, 675 
strcpyQ, 44, 273-274, 
672 

strcspnO, 672 

strerrorO, 644, 672 
strftimeO, 678-679 
stri ng_to_list(), 452 
strlenQ, 273, 673 

strncatO, 673 

strncmpO, 673 

strncpyO, 673 

strpbrkQ, 673 
strrchr(), 673 
strspnQ, 674 
strstrQ, 674 
strtod(), 666-667 
strtokO, 674 
strtol0>667 
strtoul(), 668 

strxfrmO, 675 
sumO. 223, 256 
sum_array(), 261 
swap(), 252, 289,423 
systemO, 518-519, 665 
tanQ, 127, 647 
tanhQ, 647 
timeQ, 104, 422, 528, 
530, 676 

tmpfileO, 656, 670 
tmpnam(), 519, 656 
toasciiO, 383 
tolowerQ, 383, 643, 

700 

topQ, 432, 462 
toupper(), 383, 643, 

700 

ungetcC), 660 
unlinkO < Ci 6 *+ 


unpackO, 342 
va_arg(),651 
va__end(), 651 
va„start(), 651 
va^sumO, 652 
vfork(), 682 
vfprintf(), 661 
wait(), 570 
wcstombsC), 669 
wctomb() f 669 
word_cnt(), 272 
writeQ, 681 
wrt(),269 
wrt_array(), 262 
wrt_it(), 225 
wrt„words(), 290 
fundamental data type, 
107-137 
fwriteC), 662 


%C format, 494 
%g format, 494 
garbage, 223, 247 
gdbb utility, 539 
general utilities, 663 
generic pointer, 251, 460 
generic pointer voi d *, 694 
get from operator (C++) », 
597 

get„matrix„space(), 578 
get„ji_f rom__user(), 230 
get_vector_space(), 575 
getc(), 51, 382, 658 
getchar(),39-40, 

124-125, 382, 658 
getenvQ i 
getsQ, 658 
gfopenO, 511 
global, 199, 221 
gmtimeC),677 
go_try , 581, 583 
goto, 77, 178, 182, 649 
graceful function, 510 
gracLstudent class (C++), 
609 


greater than operator >, 

147,149-150 

greater than or equal opera¬ 
tor >=, 147, 149-150 
grep utility, 539 

H 

M file extension, 14,91, 209, 
532, 538 

handler (C++), 614 
handler (Java), 636 
header file, 7, 14, 91, 701 
hexadecimal, 117, 134-137 
escape sequence, 695 
integer, 80 
Hoare, A., 391 
home„address structure, 
418 

huge (Borland), 77 


#i ncl ude, 7, 13-16, 91-92, 
365-366 
%i format, 494 
identifier, 10, 16, 78-79, 
122 , 202 
external, 79 
label, 701 
variable, 701 
#if, 384, 386 
if, 21, 77, 159-162 
if-else, 22, 159-160, 162, 
166, 181, 183 
#i fdef, 384 
#i fndef, 384 
implicit conversion, 132 
improved, 594, 626 
Improved class (Java), 626 
#i ncl ude, 209 
increment operator ++, 
85-86 

indent utility, 539 
index 

See subscript\ 245 
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indirection 

See dereferencing , 248 
infinite loop, 164, 168, 174 
infix, 464 

inheritance (C++), 608 
inheritance (Java), 632 
init(), 614 
init„gnode(),481 
init„nodeQ, 479 
initialization, 223, 695 
array, 246-247, 260, 281, 
695 

automatic aggregate, 695 
class (C++), 606 
default, 223 
structure, 418, 695 
union, 695 
variable, 223 
initialize(), 462, 468, 
472 

inline (C++), 598 
inorderO, 477 
input, 18, 493-518 
keyboard, 19 
redirect <, 27, 56-57 
stdin, 503, 654 
stream, 19 
input (C++), 595 
input/output 
character, 658 
direct, 662 
formatting, 660 
functions, 653 
int, 77, 110-111, 116-118, 
122 

int_or„ float, 425 
integer, 7, 110-114, 

116-119, 124, 134,136 
arithmetic functions, 666 
constant, 13, 79, 117 
decimal, 80 
hexadecimal, 80 
octal, 80 
overflow, 117 
promotion, 131 
size, 116 


integral type, 111, 116-117, 
123 

interprocess communica¬ 
tion functions, 683 
interrupt, 56 
interrupt (Borland), 77 
iostream.h (C++), 594 
is_flush(), 424 
isalnumO, 383, 642 
isalphaO, 40, 383, 642 
isascii(), 383 
iscntrl (), 383, 642 
isdigitQ, 383, 642 
isgraphQ, 383 
islowerO, 383, 642 
isprintO, 383, 642 
ispunct() ? 383, 642 
isspaceQ, 271, 353, 383, 
642 

isupperC), 383, 642 
isxdigitO, 383, 642 
iteration, 225 
iterative action, 147 


Java, 2, 4, 625-638 
ADT, 629 
applet, 635 
appletviewer, 636 
benefits, 638 
class, 629 
constructor, 631 
exception, 636 
inheritance, 632 
output, 626 
overloading, 631 
override, 633 
polymorphism, 633 
type, 627 
variable, 627 
jump, 178, 649 

K 

kepler , 301-302 
keplerQ, 302 


keyboard input, 19 
keywords, 12, 77 
asm (Borland), 77 
auto, 77, 205, 216-217 
break, 77, 179, 182 
case, 77, 181-182, 347, 
694 

cdecl (Borland), 77 
char, 77, 110-111, 
114-115 

const, 17, 77, 2/2, 
307-308, 596, 695 
continue, 77, 179-180, 
182 

default, 77, 182 
do, 77, 172-173, 367 
double, 77, 110-111, 
119,693 
else, 77 
enum, 77, 694 
extern, 77, 205, 
216-219, 231, 308 
far (Borland), / / 
float, 77, 110-111, 119 
for, 25, 77,167-169,171 
goto, 77, 178, 182, 649 
huge (Borland), 77 
if, 21, 77, 159-162 
if-else, 22, 159-160, 
162, 166, 181, 183 
int, 77, 110-111, 
116-118, 122 
interrupt (Borland), 77 
long, 77, 110-111, 
117-119 

long double, 110-111, 
119, 693 

long float, 693 
near (Borland), 77 
pascal (Borland), 77 
register, 77, 205, 216, 
219-220, 651 
return, 8, 35, 77, 182, 
200-201 

short, 77, 110-111, 
117-118, 131 
signed, 77,110,131,693 
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signed char, 110-111, 
115 

signed int,110 
signed long int, lit) 
signed short int,110 
sizeof, 77, 122-123, 
259, 450 

static, 77, 205, 216, 
218,220-222 
struct, 77, 407-408 
switch, 77, 181, 694 
typedef, 77, 122, 282, 
371,408,410,651 
union, 77, 424-425 
unsigned, 77, 110-111, 
117-119, 131 
unsigned char, 
110-111, 115 
unsigned int,110 
unsigned long, 
110-111, 118-119 
unsigned long int, 110 
unsigned short, 
110-111 

unsigned short int, 
110 

void, 77, 198, 202, 693, 
697 

void *,694 
void*, 251,460 
volatile, 77, 307-308, 
695 

while, 23-24, 77, 
166-167, 172 
keywords (C++) 
catch, 615 
class, 601 
delete, 607 
friend, 603 
inline, 598 
new, 607 
operator, 603 
template (C++), 612 
throw, 614 
try, 614 
keywords (Java) 

class, 625, 627, 629 


label, 178, 701 
labs(), 666 
1 conv structure, 645 

IdexpO, 648 

ldiv(), 666 
leaf node, 475 
leaving program functions, 
670 

left shift operator «, 332, 
335-336 

less than operator <, 147, 
149-151 

less than or equal operator 
<=,23, 147,149-150 
letters, 70 
(ex utility', 540 
lexical element, 69-70 

lexicoO, 382 
lib command, 526 
librarian, 526 
libraries, 91, 526-527, 
641-683 

assert.h, 212, 267, 388, 
641 

ctype.h, 382, 641-642 
errno.h, 641, 643, 695 
float.h, 641, 644, 701 
iostream.li (C++), 594 
limits.h, 339, 641, 645, 
701 

locale.h, 641, 645 
math.h, 128, 641, 646 
process.h, 558 
setjmp.h, 641, 649 
signaLh, 564, 641, 650 
standard, 8, 92-93, 273, 
641-683 

stdarg.h, 641, 651, 661 
stddef.h, 371, 641, 652 
stdio.h, 7-8, 14, 16, 18, 
39, 92, 124-125, 382, 
641, 653 

stdlib.h, 27, 94, 259, 372, 
482, 641, 663 
string.h, 43, 272, 641, 

670 


time.h , 351, 422, 528, 
641,675 
UFO,463 

limits.h, 339, 641, 645, 701 
nine, 390 
line number, 390 
—LINE__, 387, 700 
linkedjist, 449, 453, 456 

1 i nked_li st structure, 
44Q 

J. X 

Jist, 447-483 

concatenation, 457 
deletion, 459 
insertion, 458 
linked, 479-483 
operation, 451 
print, 456 
processing, 455 
queue, 471 
stack, 460 
list structure, 447 
literal, 80 
loader, 54 

local variable, 199, 217 
locale.h, 641, 645 
localization, 645 
localtimeQ, 677 
locate, 250 
log(), 127, 647 
loglOO, 647 
logical bitwise operators, 
334 

logical operators, 147-148, 
154-156 

long, 77, 110-111, 117-119 
long double,110-111, 

119, 693 

long float, 693 
longjmpO, 649 
loop, 23, 56, 163-164, 168, 
174, 182 
loop, 174 
lower_case, 519 
lowercase letter, 70 
1rand48(), 105 
Is, 56 
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macro, 124, 368-370 
argument, 377 
predefined, 387 
macros 

_DATE_ # 387, 700 

_FILE_, 387, 700 

_LINE_, 387, 700 

_STDC_, 387, 700 

_TIME_, 387, 700 

mainQ, 7, 29, 48, 290 
make utility, 212, 338, 
532-538 

makefile , 212, 532-534 
mallocO, 259-260, 
663-664 
marathon , 11 
mask, 337, 429 
math.h , 128, 641, 646 
mathematical function, 127, 
646 

matrix, 278, 282, 571-572, 
575-577, 579 
matrix, 282-283, 577-578 
max_min , 32 
MAX_RAND, 378 
maxi mum(), 31 
maxmin , 30,32 
mblenQ, 668-669 
mbstowcsQ, 669 
mbtowcQ, 669 
member, 411 

member access operator ->, 
411,413-414,428 
member access operator ., 
408,411,414 

member function (C++), 601 
member function (Java), 630 
member name, 409, 701 
memchrO, 671 
memcmpO, 671 
memcpyO, 671 
memmove(), 671 
memory 

allocation, 254, 259, 577, 
663, 671 

handling functions, 671 


location, 250 
See also allocation , 217 
memory management (C++) 
delete, 607 
new, 607 
memsetQ, 671 
mergeQ, 264, 269 
mergassort, 263-264, 266, 
269 

mergesort , 266 
mergesortC), 266, 269 
message_for(), 387 
method, 627 
mLkm, 597-598 
minimumO, 31 
minus equal operator 88 
minus operator 84, 90 
.mixed expression, 38, 133 
mktimeO, 677 
mnemonic identifier, 367 
modf(), 649 

modulus equal operator %=, 

8 0 
o 

modulus operator %, 52, 
81-82 

monitorC), 524 
moon , 596, 628 
Moon class (Java), 628 
niove(), 230 
MS-DOS, 5, 53, 55-56 
mulctime , 530 
multibyte character con¬ 
stant, 344 

multibyte string functions, 
669 

multidimensional array, 
277-290, 698 
multiplication, 12 
mul tlplyC), 283 
multiprocess, 555 
mv command, 54 
my„echo , 291 
mystring, 601 

N 

%n format, 494 


\n newline, 8-9, 79, 113 
name space, 346, 409, 701 
narrowing, 133 
natural number, 116 
nawk utility, 540 
NDEBUCC), 389, 641 
near (Borland), 77 
negation, 154-155 
negation operator !, 
147-148, 154-155 
negative constant, 80 
nested block, 214 
new (C++), 607 
new_gnode(), 481 
newline \n, 8-9, 79, 113 
Newton-Raphson algorithm, 
196 

next_day , 347 
nibble, 135-136 
nice, day, 39-40 
no_change , 36 
node, 475 

node structure, 476, 480 
nonprinting character, 112 
nonzero, 148 « 

NoSuchNameExcept!on 
class (Java), 636 
not equal operator !=, 125, 
147, 152-154 
NULL, 148, 259-260, 371 
null character \0, 41-42, 
113, 270-271 
null string, 44 
number, 79 
numeric constant, 694 

o 

%o format, 494 

.0 file extension, 211, 523, 

5 r\ r\ p r\ p r\ q 

sz-odd, bio 
.obj file extension, 211 
object, 606 
object code, 54, 69 
object-oriented program¬ 
ming, 593, 615 
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octal, 117, 135-137 
constant, 134 
integer, 80 
offset, 653 
offset pointer, 262 
one's complement operator, 
333 

OOP, 593, 615 
OOP Design Methodology, 
609 

open(), 681 
opening a file, 655 
operating system, 53-57 
operator, 69, 82 
arithmetic, 81 
assignment, 87-89 
associativity, 83-84,148, 
150, 332,415, 705 
binary, 82, 148 
bit shift, 335 
bitwise, 331-344, 429 
bitwise logical, 3 3 4 
equality, 147-148 
logical, 147-148, 

154-156 

overload (C++), 603 
precedence, 83-84, 
148-150, 332,415, 

705 

relational, 147-151 
unary, 85, 148 
operator (C++), 603 
operator+C),604-605 
operators, 81 

address &, 19, 44, 72, 

248, 251,428 
and (bitwise) &, 332, 334 
and (logical) &&, 147, 
154-157 
and equal &=, 88 
assignment =, 10, 12, 
87-88, 108-109 
comma ,, 82, 171-172, 
699 

complement ~, 332-333 
conditional expression 
?:, 182-183 


decrement 85-86 
defined preprocessor, 
700 

delete (C++), 607 
dereferencing or indirec¬ 
tion *, 44, 248, 250, 
253, 289 

divide equal /=, 88 
equal ==, 21, 95, 147, 

I p* rj i rn 

:>z-1 :>.> 

exclusive or (bitwise) a, 
332, 334 

function call (), 8, 207 
get from (C++) », 597 
greater than >, 147, 
149-150 

greater than or equal >=, 
147,149-150 
increment ++, 85-86 
left shift«, 332, 
335-336 

less than <, 147,149-151 
less than or equal <=, 23, 
147, 149-150 
member access ->,411, 
413-414, 428 
member access .,408, 
411, 414 
minus -, 84, 90 
minus equal -=, 88 
modulus %, 52, 81-82 
modulus equal %=, 88 
negation !, 147-148, 
154-155 
new (C++), 607 
not equal !=, 125, 147, 
152-154 

or (bitwise) | J| | ^ Ut 
or (logical) | |, 147, 

154-157 

or equal (exclusive) a= 

88 

or equal (inclusive) | =, 88 
overload (C++), 610 
plus (unary) +, 696 
plus +, 84 
plus equal +=, 88 


put to (C++) «, 595 
right shift», 332, 
335-336 

shift left equal «=, 88 
shift right equal »=, 88 
sizeof, 77, 122-123, 
259, 450 
subscript [], 37, 
245-248, 253, 262 
times equal *=, 88 
or (bitwise) operator j, 332, 
334 

or (logical) operator | |, 147, 
154-157 

or equal (exclusive) operator 
A=, 88 

or equal (inclusive) operator 
I =, 88 
OS/2, 5 

output, 6-10,18, 493-518 
redirect >,56-57 
stdout,503,654 
output (C++), 594 
output (Java), 626 
overflow, 117 
overlaying a process, 682 
overload (C++), 603 
function, 603, 610 
operator, 610 
operator, 603 
operators, 603 
overload (Java) 
function, 631 
ovLstring, 604 

P 

%p format, 494 
pacific_sea, 15-16 
packO, 341 
pack_bits, 341 
parallel block, 215 
parameter 

array, 256, 279 
formal, 34, 198, 296 
list, 199, 202 
type list, 29 
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template, 612 
parentheses (), 7, 12, 34, 
41, 71, 82-83,125,181, 
198, 200, 248, 368 
partitionO, 393 
pascal (Borland), 77 
pcard structure, 427 
pcloseQ, 520 
perl utility, 540 
perrorO, 644, 658, 672 
person , 629 

Person class (Java), 629 
Personl class (Java), 634 
Person2 class (Java), 637 
PersonTest class (Java), 
630 

pgm.h, 210 
philosopher(), 570 
pick__up(), 569 
pipe, 520, 526, 683 
pipe(), 561, 683 
play„poker(),421-422 
plus (unary) operator +, 696 
plus equal operator +=, 88 
plus operator +, 84 
pointer, 36, 42, 46, 205, 
248-269,413-414 
arithmetic, 36, 254-255, 
257 

array of, 284, 572 
arrays, 253, 698 
call-by-reference, 252 
constant, 46, 308 
conversion, 251 
expression, 255, 701 
file, 47 

generic, 251, 460 
NULL, 148 
offset, 262 

to, 19, 44, 72, 248, 251, 
428 

to file, 50 
variable, 46, 248 
poker , 419 


Polish, notation, 464-466, 
468 

polish_stack , 465-468, 470 
polymorphic function (C++), 
610 

polymorphism (Java), 633 
polynomial, 296-297 
pop(),432-433, 462 
popenQ, 520 
postfix, 85-86, 274 
postorderC), 478 
pow(), 29, 127, 648 
pow„of^2 i 89-90 
power (),204 
power^square, 127 
#pragma, 389, 700 
precedence, 83-84, 

148-150, 332, 415, 705 
precision, 121, 495, 497 
predefined macro, 387 
prefix, 85-86 
preorderO, 478 
preprocessor, 7, 69, 91, 
365-394, 700 
#, 7, 13, 91, 365 
#def1 ne, 13-14, 16, 
366-368 
#elif, 386, 700 
#else, 386 
#endif, 384, 386 
#error, 389, 700 
#if, 384, 386 
#ifdef, 384 
#ifndef, 384 
#include, 7, 13-16, 
91-92, 209, 365-366 
#11ne, 390 
#pragma, 389, 700 
#undef, 384-385 
directives, 91 
operator #, 387-388, 700 
operator ##, 387-388, 
700 

Press, W., 105 
PRINTO, 377, 379 
prinLargs , 558-559 
pri nt_Ji st(), 456 


printenv, 521 

printfO, 18-20, 493,495, 
660 

printing character, 111 
printlnC) (Java), 627 
privacy, 221 
privacy (C++), 601 
prn_array(), 374 

prn_card_va1ues(),421 
prn_data(). 470 
prn_final_status(),351 
prn„game_status(), 351 
prnjielp()> 351 
prn_Jnfo(), 31, 508 
prn_Jnstructions(),352 
prn^rand, 93-94 
prn_$tackO, 470 
prn__tbl_of_powers(), 

203 

prn_time(), 529 
probabilityQ, 222 
process, 555 
process.h , 558 
profiler, 524 
program 

characters, 70 
leaving, 670 
suspension functions, 
683 

program development, 209, 
212 

promotion, 38, 131-133 
prototype, 18, 29, 32, 92, 
201-205, 697 
pseudo random-number 
generator, 665 
punctuator, 71, 81-82 
pure virtual function (C++), 
611 

push(), 431, 433, 462, 468 
put to operator (C++) «, 
595 

put_down(), 569 
putcQ, 52, 382, 659 
putcharO, 39-40, 
124—125, 382, 659 
put$(), 659-660 
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qsort, 373, 524 
qsortO, 372, 375, 379, 
381,664-665 
qualifier, 17, 307 
question mark \?, 113 
queue, 471 
queue, 471-472, 474 
quicksort, 391, 394 
quicksort(), 372, 392 

R 

\r carriage return, 113 
ragged_array, 292 
raiseO, 650 
randO, 95, 104, 353, 376, 
422,665 
rand(), 94 
RAND_MAX, 101 
randomO, 222 
random-number functions, 
665 

range, 121 

range error, 644, 647 
read(), 681 
real numbers, 121 
real roots, 191 
reallocO, 663-664 
rectangle class (C++), 611 
recursion, 223-232, 452 
redirect input <, 27, 56-57 
redirect output >, 56-57 
register, 77, 205, 216, 
219-220, 651 
relational operators, 
147-151 

release_matri x_ 
space(), 577-578 
remove(), 519, 662 
rename, 55 
renameO, 662 
report_and_tabulate(), 
355 

reserved word, 7, 12 
resetQ, 431, 433 


return, 8, 35, 77, 182, 
200-201 
reverseO, 614 
rewindO, 655-657, 660 
Richards, M., 1 
right shift operator », 332, 
335-336 
Ritchie, D., 1 
rocks _ pape r_scissors, 
349-352, 354-355 
root node, 475 
rules, 534 

run-time error, 54-55 
running_sum, 26-27 

s 

%s format, 494 

s_to_l 0,453 

scalar, 282 

scan field, 501 

scanfO, 18-21,499, 661 

scientific notation, 17, 119 

scope 

block, 213-214 
function, 222 
restriction, 221 
rules, 213 
scores , 37 
sea , 6, 9-10 

searching functions, 664 
sed utility, 540 
selection_by__ma- 
chineO, 352 
selection_by_player(), 
353 

self-referential structure, 
447 

semaphore, 568 
semicolon terminator ;, 8, 
11, 63, 82, 87, 108, 158 
set, 521 
setbuf(), 656 
setjmpC), 649 
setjmp.h , 641, 649 
setlocaleO, 646 
setvbufO, 656 


shell, 580-582, 584 
shift left equal operator 
«=, 88 

shift operator, 335 
shift right equal operator 
»=, 88 

short, 77, 110-111, 
117-118, 131 

short-circuit evaluation, 157 
shuffle(), 423 
side-effect 
fputcC), 659 
getc(), 658 
increment and decre¬ 
ment, 86 

macro evaluation, 658 
SIGJDFL, 564-565 
SIGJCGN, 564-565 
SIGFPE, 564-565 
SICILL, 564 
SIGINT, 564 

signal handling, 564, 650 
signal (), 564-566, 570, 
650 

signalh , 564, 641, 650 
signed, 77, 110, 131, 693 
signed char, 110-111, 115 
signed int, 110 
signed long int,110 
signed short int,110 
SIGSEGV, 564 
sin(), 127, 295, 647 
single quote \', 113 
sinh(), 647 

sizeof, 77, 122-123, 259, 
450 

sleepO, 683 
small ^integer structure, 
428 

sort 

bubble, 38, 257-258 
functions, 664 
merge, 263 
quick sort, 372 
quicksort, 391, 394 
sort, 288, 377, 380-381 
sort.h , 284, 378 
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sort_words , 285, 288-290 
sort_words(), 288 
source file, 5, 534 
spawn...(), 560, 682 
spawnl (), 560, 680 
special characters, 82 
sprintf(),503-504 
sqrtQ, 127, 648 
sqrt_pow , 128 
square class (C++), 612 
srandO, 95, 422, 665 
srand48(), 105 
sscanfO, 503-504, 661 
stack 

as list, 460 
stack , 431-432, 434, 
461-463, 613 
stack class (C++), 613 
stack structure, 431-432, 
461, 466 
standard C, 1, 3 
standard header file, 14 
standard library, 8, 92-93, 
273,641-683 
start_time(),529 
statement, 11 

compound, 22, 157 
empty, 158 
expression, 158-159 
labeled, 178 
terminator ;, 8, 11, 63, 
82, 87, 108, 158 
statements 

break, 77, 179, 182 
case, 77, 181-182, 347, 
694 

continue, 77, 179-180, 
182 

default, 77, 182 
do, 77, 172-173, 367 

C I ^ C j / / 

for, 25, 77,167-169,171 
goto, 77, 178, 182, 649 
if, 21, 77, 159-162 
if-else, 22, 159-160, 
162, 166, 181, 183 


return, 8, 35, 77, 182, 
200-201 

switch, 77, 181, 694 
while, 23-24, 77, 
166-167, 172 

static, 77, 205, 216, 218, 
220-222 

status, 579, 581-582, 584 
stdarg.h , 641, 651, 661 

__STDC_, 387, 700 

stddef.h , 371, 641, 652 
stderr, 503, 654 
stdin, 503, 654 
stdio.h , 7-8, 14, 16, 18, 39, 
92, 124-125, 382, 641, 
653 

stdlib.h, 27, 94, 259, 372, 
482, 641, 663 
stdout, 503, 654 
Steps to be followed in writ¬ 
ing and running a C pro¬ 
gram, 54 

storage allocation, 215 
storage class, 205, 216-222 
storage mapping, 279 
storage types 
auto, 77, 205 
auto 

automatic vari¬ 
ables, 216-217 
extern, 77, 205, 231, 308 
extern 

global, 216-219 
register, 77, 205, 651 
register 

memory register, 
216, 219-220 
static, 77, 205, 216, 
218,220-222 
strcatO, 273, 275-276, 
671 

strchrC), 672 
strcmpO, 273, 672, 675 
strcoll(), 672, 675 
strcpy(),44, 273-274, 672 
strcspnQ, 672 
stream I/O (C++), 595-596 


streams, 654 

stderr, 503, 654 
stdin, 503, 654 
stdout, 503, 654 
strerrorQ, 644, 672 
strftimeC),678-679 
string, 36, 39, 46, 270-276 
constant, 8, 80-81, 270, 
694-695 
control, 12, 18 
conversion functions, 
666 

conversion specification, 
12, 18, 495, 502 
functions, 272-273 
handling functions, 671 
literal, 80 

multibyte functions, 669 
null, 271 
size, 270 

stri ng class (C++), 601, 
603-604, 606 
string.h , 43, 272, 641, 670 
string__to_l i st(), 452 
stringization, 387-388, 700 
strlenO, 273, 673 
strncatC),673 
strncmpO, 673 
strncpyC),673 
strpbrkO, 673 
strrchrO, 673 
strspnC), 674 
strstrO, 674 
strtodO, 666-667 
strtokO, 674 
strtol(),667 
strtoul(),668 
struct, 77, 407-408 
structure, 407-424, 
447-483, 699 
access, 411, 417 
as argument, 416 
initialization, 418, 695 
list, 447-483 
member, 411 
pointer to, 414 
self-referential, 447 
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structures 
abc, 428-429 
card, 40/-409, 419 
complex, 413 
data, 465 
date, 410 
dept, 416 
elem, 465,471 
flower, 426 
fruit, 409, 426 
home_address,418 
Iconv, 645 
linked_Jist, 449 
list, 447 
node, 476, 480 
pcard, 427 
smal l_integer, 428 
stack, 431-432, 461, 
466 

student, 411 
tm, 6/5 

vegetable, 409, 427 
strxfrinC), 675 
student, 411 

student class (C++), 609 
Student class (Java), 

633-634 

student structure, 411 
StudentTest class (Java), 

634 

subscript operator [], 37, 

245-248, 253, 262 
subscript range, 575 
subtree, 475 
sum , 71 

sum(), 223, 256 
sunuarrayO, 261 
sum_square, 294-295 
suspend program functions, 

683 

swapO, 252, 289, 423 
switch, 77, 181, 694 
symbolic constant, 14, 367, 

D O 3 

syntax, 69, 73 
syntax error, 54-55 
syntax summary, 685-692 


system command, 518 
system names, 79 
system tools, 518-540 
systemQ, 518-519, 665 


tab \t, 112-113 

table_of^powers, 203-204 

tables 

Access permissions, 517 
Additional keywords for 
Turbo C, 77 

ASCII abbreviations, 704 
Assignment operators, 

88 

Binary representation, 
343 

Bitwise and two’s com¬ 
plement, 333 
Bitwise operators, 332 
C compiler that gets in¬ 
voked, 522 
C header files, 641 
Calls to sum(), 257 • 
Character constants and 
integer values, 112 
Compiler options, 523 
Conversions to base 2, 8, 

1U, lo, 13 d 

ctype.h calls, 383 
ctype.h macros, 383 
Decimal and hexadeci¬ 
mal 134 

Declarations of arrays, 
277 

Expressions equivalent to 
a[i][j], 278 
File modes, 506 
File names, 514 
Filenames written in C, 
503 

float and unsigned, 119 
Function call for sum(), 
224 

Fundamental data types, 
110 


Fundamental data types 
long form, 110 
Fundamental type sizes, 
142 

Fundamental types by 
functionality, 111 
Hex adec im al numb er 
c onversions, 13 5 
Keywords, 77 
long and unsigned, 119 
Modes for spawn...(), 
560 

Octal digit in the file per- 
mission, 517 
Operating system com- 
mands, 539 

Operator precedence and 
associativity, 84, 148, 
332,415, 705 
Predefined macro,387 
printf () conversion 
character, 19, 494 
Production symbols, 74 
program characters, 70 
Relational, equality, and 
logical operators, 147 
scanf () conversion, 500 
scanf () conversion, 20 
Special characters, 113 
Three modes for a file, 47 
Two-dimensional array, 
278 

Using strftimeQ, 679 
Utilities, 540 
Value of fibonacci, 227 
Values of and and or ex¬ 
pressions, 156 
Values of bitwise opera¬ 
tors, 334 

Values of equality ex¬ 
pressions, 152 
Values of not expres¬ 
sions, 154 

Values of relational ex¬ 
pressions, 150 
tag name, 345-346, 
407-410, 424, 701 
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tan(), 127, 647 
tanh(), 647 
tbLotypowers , 206 
fee command, 53, 55 
template, 345, 408,410,424 
template (C++), 612 
temporary file, 510 
text editor, 5, 53-55 
Thompson, K», 1 
three-dimensional array, 
280 

th row (C++), 614 
time functions, 675 
timeQ, 104,422,528, 530, 
676 

time.h, 351, 422, 641, 675 
timed! library, 528 

_TIME_, 387, 700 

times equal operator 88 
timing code, 528-530 
tlib command, 526 
tm structures, 675 
tmpfileQ, 656, 670 
tmpnamC), 519, 656 
toasen (), 383 
token, 69-70, 81 
tolowerQ, 383, 643, 700 
tools, 91, 518-540 
top(), 432, 462 
top-down design, 204 
touch utility, 538 
toupperC),383, 643, 700 
towers^otyhanoi, 229-231 
traditional C, 1 
vs. ANSI, 693-702 
translation unit, 54 
traversal, 477, 482 
tree, 479-480, 482 
binary, 475-483 
true, 21, 147-148 
truth table, 192 
try (C++), 614 
try_ me, 580 
try_qsort, 374 
Turbo C keywords, 77 
two-dimensional array, 
277-278 


two’s complement, 333-334 

type 

arithmetic, 111 
class (C++), 601 
derived, 245, 424 
derived data, 408 
enumeration, 131, 
345-355, 694 
floating, 17, 111, 119 
function, 198-199 
fundamental, 107-137 
generic pointer, 251, 460 
integral, 111, 116-117, 
123 

Java, 627 
parameter list, 29 
qualifier, 17, 307 
user-defined, 408, 602 
typedef, 77, 122,282,371, 
408,410,651 
types, 693 

char, 77, 110-111, 
114-115 

class (C++), 601 
class (Java), 625,627, 
629 


double, 77, 110-111, 
119, 693 
enum, 77, 694 
float, 77, 110-111, 119 
int, 77, 110-111, 

116-118, 122 
long, 77, 110-111, 

117-119 

long doubl e, 110-111, 
119, 693 

long float, 693 
short, 77, 110-111, 


117-118, 131 
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signed, 77,110,131, 693 
signed char, 110-111, 
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signed int, 110 
signed long int,110 
signed short int,110 
struct, 77, 407-408 
template (C++), 612 


union, 77, 424-425 
unsigned, 77, 110-111, 
117-119, 131 
unsigned char, 
110-111, 115 
unsigned int,110 
unsigned long, 
110-111, 118-119 
unsigned long int, 110 
unsigned short, 
110-111 

unsigned short int, 
110 

void, 77, 198, 202, 693, 
697 

void*, 251, 460 


%u format, 494 

unary operator, 85, 148 

#undef, 384-385 

underscore 78-79 

ungetcO, 660 

union, 424-430, 695, 699 

union, 77, 424-425 

UNIX, 1-2, 5, 53, 55-56, 130 

unlinkQ, 662 

unpackO, 

unsigned, 7/, 110-111, 

117- 119, 131 

unsigned char, 110-111, 

1 1 :> 

unsigned int, 110 
unsigned long, 110-111, 

118- 119 

unsigned long int,110 
unsigned short, 110-111 
unsigned short int, 110 
uppercase, 520 
uppercase letter, 70 
user-defined type, 408, 602 
usual arithmetic conver¬ 
sion, 131 
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utilities, 524-540 
awk, 540 
bison, 540 
cb, 539 
csh, 540 
dbx, 539 
diff 539 
flex, 540 
gdb , 539 
grep, 539 
indent , 539 
/ex, 540 

make , 212, 338, 532-538 
naw/q 540 
perl, 540 
sed, 540 
touch , 538 
wc, 539 
yacc, 540 
utility library, 663 


\v vertical tab, 113 
va_arg(), 651 
va_end(), 651 
va„start(), 651 
va_sum(), 652 
variable, 10, 12, 107, 223, 
248 

aggregate, 407 

arguments, 651 

boolean, 169 

environment, 521-522 

external, 199 

global, 199, 216-219, 221 

identifier, / 01 

Java, 627 

local, 199, 217 

loop, 219 

pointer, 46 

static external, 221 

storage allocation, 215 

storage class, 216-222 

string, 270 

type, 216 

visibility, 215 


vector, 282, 575 
See array, 262 
vegetable structure, 409, 
427 

vertical bar as choice sepa¬ 
rator |, 73-74 
vertical tab \v, 113 
vforkO, 682 
vfprintf (), 661 
vi editor, 54 
vi text editor, 5 
visibility, 221 

void, 77, 198, 202,693,697 
void *,694 
void*, 251,460 
volatile, 77, 307-308, 695 

w 

wait, 568 
waitQ, 5/0 
wc utility, 539 
wchar_t, 3/1 
wcstombsO, 669 
wctombO, 669 
wgcd class (Java), 635 
while, 23-24, 77, 166-167, 
172 

white space, 70, 72-73, 
81-82 

widening, 132-133 
width, 427 
word_cnt(), 272 
write(), 68i 
wrt, 211 
wrtQ, 269 

wrt_array(). 262 

wrt_bkwrds, 225-226 
wrt_info(),211 
wrt_it()» 225 
wrt_words(), 290 


Y 

yacc utility, 540 


zero, 41—42, 44, 113, 148, 
223,270-271 


%x format, 494 



C Interfaces and Implementations 

Techniques for Creating Reusable Software 
David R. Hanson 

Every programmer and software project manager must master tire art of creating 
reusable software modules, which are the building blocks of large, reliable 
applications. Unlike some modem object-oriented languages, C provides little 
linguistic support or motivation for creating reusable application programming 
interfaces (APIs). While most C programmers use APIs and the libraries that 
implement them in almost every application they write, relatively few program¬ 
mers create and disseminate new, widely applicable APIs. C Interfaces and 
Implementations shows how to create reusable APIs using interface-based design, 
a language-independent methodology that separates interfaces from their 
implementations. This methodology is explained by example. The author 
describes in detail twenty-four interfaces and their implementations, providing 
the reader with a thorough understanding of this design approach. 

544 pages • Paperback • ISBN 0-201-49841-3 
http: / / www.awl.coin / cseng / ti ties / 0-20 1 -49841-3/ 
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A Retargetable C Compiler 

Design and Implementation 

Christopher W. Fraser and David R, Hanson 

This book examines the design and implementation of lcc, a production-quality, 
retargetable compiler for the ANSI C programming language designed at 
AT&T Bell Laboratories and Princeton University. The authors' innovative 
approach—a "literate program" that intermingles the text with the source 
code—gives a detailed tour of the code that explains the implementation and 
design decisions reflected in the software. And while most books describe toy 
compilers or focus on isolated pieces of code, the authors provide the entire 
source code for a real compiler, which is available via ftp. Structured as a self- 
study guide that describes the real-world tradeoffs encountered in building a 
production-quality compiler, this book is useful to individuals who work in 
application areas applying or creating language-based tools and techniques. 

592 pages • Hardcover • ISBN 0-8053-1670-1 
h ttp: / / www.awl.com/cseng/ titles/0-8053-1670-1 / 
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C Programming FAQs 

Frequently Asked Questions 
Steve Summit 

Steve Summit furnishes you with answers to some of the most frequently 
asked questions in C. Extensively revised from his popular FAQ list on the 
Internet, more than 400 questions are answered to illustrate key points and 
to provide practical guidelines for programmers. C Programming FAQs is a 
welcomed reference for all C programmers, providing accurate answers, 
insightful explanations, and clarification of fine points, along with numerous 
code examples. 

432 pages • Paperback • ISBN 0-201-84519-9 
http://www.aw4.com/cseng/titles/0-201-84519-9/ 











